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I. AIMS AND SCOPE 

These lecture notes introduce some simple effective 
Hamiltonians (also known as semi-empirical models) that 
have widespread applications to solid state and molecular 
systems. They are aimed as an introduction to a begin- 
ning graduate student. I also hope that it may help to 
break down the divide between the physics and chemistry 
literatures. 

After a brief introduction to second quantisation no- 
tation (section |ll]) , which is used extensively, I foc us o f 
the "four H's": the Hiickel (or tight binding; section IH|, 
IV), Heisenberg (section |V I and Hol- 
models. These models p ay central 



Hubbard (section 



stein (section VII 



roles in our understanding of condensed matter physics, 
particularly for materials where electronic correlations 
are important, but are less well known to the chem- 
istry community. Some other related models, such as the 
Pariser-Parr-Pople model, the extended Hubbard model, 
multi-orbital models and the ionic Hubbard model, are 



also discussed in section VI As well as their practical 
applications these models allow us to systematically in- 
vestigate electronic correlations by 'turning on' various 
interactions in the Hamiltonian one at a time. Finally, 
in section VIII[ I discuss the epistemological basis of ef- 
fective Hamiltonians and compare and contrast this ap- 
proach with ab initio methods before discussing the prob- 
lem of the parameterisation of effective Hamiltonians. 

As these notes are intended to be introductory, I will 
not attempt to make frequent comparisons to the latest 
research problems, rather I compare the predictions of 
model Hamiltonians with simple systems chosen for ped- 
agogical reasons. Likewise, references have been chosen 
for their pedagogical and historical value rather than on 
the basis of scientific priority. 

Given the similarity in the problems addressed by 
theoretical chemistry and theoretical condensed matter 
physics, there are surprisingly few advanced texts dis- 
cussing the interface of two subjects. This, unfortu- 
nately, leads to many cultural differences between the 
fields. Nevertheless, some textbooks do try to bridge the 
gap, and the reader in search of more than the introduc- 
tory material presented here is referred to Refs. Jj and 

m 



II. A BRIEF INTRODUCTION TO SECOND 
QUANTISATION NOTATION 

The models discussed in these notes are easiest to un- 
derstand if one employs the so-called second quantisation 
formalism. In this section we briefly and informally in- 
troduce this formalism. More details can be found in 
many textbooks, (e.g. Refs. IHlandU]). Readers already 
familiar with this notation may wish to skip this section, 
however, the last two paragraphs do define some nomen- 
clature that is used throughout these notes. 



A. The simple harmonic oscillator 

Let us begin by considering a particle of mass m mov- 
ing in a one-dimensional harmonic potential: 
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This may be familiar as the potential of an ideal spring 
displaced from its equilibrium position by a distance x, in 
which context k is known as the spring constant ([5]). Eq. 
[T]is also the potential felt by an atom as it is displaced (by 
a small amount) from its equilibrium position in molecule 
Classically this problem is straightforward to solve 
(l5| and, as well as the trivial solution, one finds that 
the particle may oscillate with a resonant frequency uj — 
^Jkjm. The time-independent Schrodinger equation for 
a simple harmonic oscillator is therefore, 
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is the 



^ is the particle's momentum and ipn 
n**^ wavefunction or eigenfunction, which has energy, or 
eigenvalue, En- 

This problem is solved in many introductory texts on 
quantum mechanics Q using the standard methods of 
'first quantised' quantum mechanics. However, a more 
elegant way to solve this problem is to introduce the 'lad- 
der operator'. 
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One of the most important features of quantum me- 
chanics is that momentum and position do not commute 
(O, i.e., [p, x]=px — xp= —ih. From this commutation 
relation it is straightforward to show that 



At; 



1 



and 



fa, a^l = aa^ — a^a = 1. 



(4) 



(5) 



One can also show that [Hsho,o] = [huj{a^d + ^),a] = 
h(jj[a\d]a = —hujci in a similar manner. Therefore 
["Hsho, aj'/'ri = —hijjaij^n and hence 



(6) 



Eq. [g] tells us that atpn is an eigenstate of Usho with 
energy En — huj, provided a^n 7^ 0. That is, the operator 
a moves the system from one eigenstate to another whose 
energy is lower by hiu, thus a is known as the lowering or 
destruction operator. 

Note that for any wavefunction, (j>, (0|j5^|^) > and 
((^|f2|<^) > 0. Therefore, it follows from Eq. |2]that En > 
for all n. Hence, there is a lowest energy state, or 
ground state, which we will denote as V'o- Therefore there 
is a limit to how often we can keep lowering the energy of 
the state, i.e., atpo = 0. We can now calculate the ground 
state energy of the harmonic oscillator. 



'HshoV-'o ^ huj { d^d + -] iJjQ 
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(7) 



In the same way as we derived Eq. [6j one can easily 
show that "HshoS^V'n — {En + hijj)d^ipn- Therefore 
moves us up the ladder of states that d moved us down. 
Hence a) is known as a raising or creation operator. Thus 
we have 



d^ipn = Vn + l?A„+i, 



(8a) 
(8b) 



where the terms inside the radicals are required for the 
correct normalisation of the wavefunctions (8J • Therefore 
V'n = v=f {a^Ti^o and 



En ~ huj n 



1 



(9) 



B. Second quantisation for light and matter 

We can extend the second quantisation formalism to 
light and matter. Let us first consider bosons, which 
are not subject to the Pauli exclusion principle, e.g., 
phonons, photons, deuterium nuclei, *He atoms, etc. We 
define the bosonic 'field operator' W{y) as creating a bo- 
son at position r, similarly, 6(r) annihilates a boson at 
position r. The bosonic field operators obey the commu- 
tation relations [6(r),S(r')] = 0, p {r),}}^ {v')] = 0, and 



[S(r),fot(r')] =5(r-r') 



(10) 



This is just the generalisation of Eq. |5] for the field op- 
erators. We can create any state by acting products, or 
sums of products, of the &^(r) on the vacuum state, i.e., 
the state that does not contain any bosons, which is usu- 
ally denoted as |0). 

Many body wavefunctions for fermions, e.g. electrons, 
protons, neutrons, "^He atoms, etc., are complicated by 
the need for the antisymmetrisation of the wavefunction, 
i.e., the wavefunction must change sign under the ex- 
change of any two fermions. Therefore, if we introduce 
the fermionic field operators ^/;^(r) and "0(1"), which, re- 
spectively, create and annihilate fermions at position r, 
we must make sure that any wavefunction that we can 
make by acting some set of these operators on the vac- 
uum state is properly antisymmetrised. This is ensured 
(j9|) if one insists that the field operators anti-commute, 
i.e., if 



{?/'(r),VHr')} = V-Wr (!■') + rCi-OV-W ='5(r-r'), 

(11a) 
(lib) 
(11c) 



{i^{v),i>{v')} = 0, 
{^t(r)>^(r')} = 0. 



This guarantee of an antisymmetrised wavefunction is 
one of the most obvious advantages of the second quanti- 
sation formalism as it is much easier than having to deal 
with the Slater determinants that are typically used to 
ensure the antisymmetrisation of the many-body wave- 
function in the first quantised formalism (3). 

For any practical calculation one needs to work with a 
particular basis set, {0i(r)}. The field operators can be 
expanded in an arbitrary basis set as 



= ^Ci0,(r), 

i 



(12a) 
(12b) 



Notice that above we solved the simple harmonic oscil- 
lator, i.e., calculated the energies of all of the eigenstates, 
without needing to find explicit expressions for any of the 
first quantised eigenfunctions, tpn- This general feature 
of the second quantised approach is extremely advanta- 
geous when we are dealing with the complex many-body 
wavefunctions typical in condensed matter physics and 
chemistry. 



Thus (fp annihilates (creates) a fermion in the state 
(f>i(r). These operators also obey fermionic anticommu- 
tation relations. 



{c^,Cj} = 0, 

{£lc]} = 0. 



(13a) 
(13b) 
(13c) 
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As fermions obey the Pauli exclusion principle there 
can be at most one fermion in a given state. We will 
denote a state in which the i*'^ basis function contains 
zero (one) particles by |0,;) (|lj))- Therefore 



where 



= 
= IL) 



C^\l^) = m 

0. 



cUu 



(14) 



It is important to realise that the number is very dif- 
ferent from the state |0i). 

Any operator acting on a system of fermions can be 
expressed in terms of the c operators. A particularly 
important example is the 'number operator', fii = c|ci, 
which simply counts the number of particles in the state 
i - as can be confirmed by explicit calculation from 
Eqs. [T4j The total number of particles in the system 
is therefore simply the expectation value of the operator 
N = ^ • fii = ^ - clci. Importantly, because we can write 
any operator in terms of the c operators, we can calcu- 
late any observable from the expectation value of some 
set of c operators. Thus we have access to a complete 
description of the system from the second quantisation 
formalism. Further, we can always write the wave func- 
tion in terms the c operators if an explicit description 
of the wavefunction is required. For example the sum of 
Slater determinants, 



*(ri,r2) = a 



<^i(ri) 02(ri) 
</'i(r2) 02(r2) 



hiri) 04 (ri) 
h{i^2) <?i'4(r2) 



(15) 



2m 



U{v) 



0,(r) (19) 



Vijki ~ 



dhi I d^V2<P*{v^)(j),{vi) (20) 
xF(ri -r2)0fe(r2)0i(r2), 



and the labels i, j, k and I are taken to define the spin 
as well as the basis function. This is exact provided we 
have an infinite complete basis. But practical calcula- 
tions require the use of finite basis sets and often use 
incomplete basis sets. The simplest approach is to just 
ignore this problem and calculate t.^ and Vij^i directly 
from the finite basis set. However, this is often not the 
best approach. We will delay a detailed discussion of why 
this is and of the deep philosophical issues raised by this 
until section [VIII| We also delay discussion of how to cal- 
culate these parameters until section [VIII| Until then we 
will simply assume that tij, Vijki and other similar pa- 
rameters required are known and instead focus on how to 
perform practical calculations using models of the form 
of Eq. [18] and closely related Hamiltonians. 

In what follows we will assume that the states created 
by the c| operators form an orthonormal basis. This 
greatly simplifies the mathematics, but differs from the 
approach usually taken in introductory chemistry text- 
books as most quantum chemical calculations are per- 
formed in non-orthogonal bases for reasons of computa- 
tional expedience. 



describes the same state as 



1^-) = (aciC2-f /3c3C4)|0), 



(16) 



where |0) = |0i, O2, O3, O4, . . . ) is the vacuum state, as 
^'(ri,r2) = (ri,r2|*) (cf. Ref. 18^. 

Often, in order to describe sohd state and chemical 
systems, one needs to describe a set of electrons whose 
behaviour is governed by a Hamiltonian of the form 



N 



2m 



C/(r„ 



,(17) 



where F (r„ — r^) is the potential describing the interac- 
tions between electrons and U{ri) is an external potential 
(including interactions with ions or nuclei, which may of- 
ten be considered to be stationary on the time scales 
relevant to electronic processes - although we will discuss 
effects due to the displacement of the nuclei in section 
VII I . In terms of our second quantisation operators this 



Hamiltonian may be written as 



I'^VijkiclclciCj, (18) 



III. THE HUCKEL OR TIGHT-BINDING MODEL 

The simplest model with the form of Eq. [18] is usual 
called the Hiickel model in the context of molecular sys- 
tems ([To]) and the tight-binding model in the context of 
crystals fTT]) . In these models one makes the approxi- 
mation that Vijki = for all i, j, k, and I. Therefore 
these models explicitly neglect the interactions between 
electrons. Both models are identical, but slightly differ- 
ent notation is standard in the different contexts. We 
assume that our basis set consists of orbit als centred on 
particular sites, as we will in all of the models considered 
in these notes. These sites might be, for example, atoms 
in a molecule or solid, chemical groups in a molecule, p-d 
hybrid states in a transition metal oxide, entire molecules 
in a molecular crystal, or even larger strucutures. Clearly 
the simplest problem has only one orbital per spin state 
on each site, in which case. 



(21) 



ijkl 



where c^^^ annihilates (creates) an electron with spin a 
in an orbital centred on site i. 
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A. Molecules (the Hiickei model) 



The standard notation in this context is ta — —ai, 
j — if site i and site j are connected by a chemical 



bond; one assumes that = otherwise. Note that the 
subscripts on a and /3 are also often dropped, but they 
are usually implicit; if the molecule contains more than 
one species of atom the as will clearly be different on the 
different species and the /3s will depend on the species of 
each of the atoms that the electron is hopping between. 
Therefore, 

"HHiickcl = '^CticlcCia + ^ /^JjcJffCjo-, (22) 
icr (ij)a 

where {ij) serves to remind us that the sum is only over 
those pairs of atoms joined by a chemical bond. Note 
that j3ij is typically negative. 



1. Molecular hydrogen 
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FIG. 1 The energy levels of the atomic and molecular or- 
bitals in the Hiickei description of H2. The bonding orbital 
is lower in energy than the atomic orbital, whereas the 
antibonding orbital is |/3| higher in energy than the atomic 
orbital. Therefore, neutral H2 is stabilised by 2|/3| relative to 
2H. 



Clearly, in H2 there is only a single atomic species. In 
this case one can set ai = a for all i without loss of 
generality. Further, as there is also only a single bond, 
we may also choose Pij — /3 giving 

a- a 

(23) 

where we have labelled the two atomic sites 1 and 2. 
This Hamiltonian has two eigenstates: one is known as 
the bonding state, 

1^6.) = ^(4 + 4)10), (24) 

and the other is known as antibonding state, 

\^aa) = ^icl-cl)\0). (25) 

The bonding state has energy a + /3, whereas the an- 
tibonding state has energy a — P, recall that /? < 0. 
Therefore every electron in the bonding state stabilises 
the molecule by an amount |/?|, whereas electrons in the 
antibonding state destabilise the molecule by an amount 
|/3|, hence the nomenclature.^ This is sketched in Fig. [l] 
Because Vijki = the electrons are non-interacting and 
so the molecular orbitals are not dependent on the occu- 
pation of other orbitals. Therefore to calculate the total 
energy of the ground state of the molecule one simply fills 
up the states starting with the lowest energy states and 



Note that in a non-orthogonal basis the antibonding orbital may 
be destabilised by a greater amount than the bonding orbital is 
stabilised. 



respecting the Pauli exclusion principle. If the two pro- 
tons are infinitely separated /3 — and the system has 
total energy Na, where TV is the total number of elec- 
trons, has only one electron, which, in the ground 
state, will occupy the bonding orbital, and so has a 
binding energy of /3. H2 has two electrons; in the ground 
state these electrons have opposite spin and therefore can 
both occupy the bonding orbital. Thus H2 has a binding 
energy of 2/3. has three electrons, so while two can 
occupy the bonding state one must be in the antibond- 
ing state, therefore the binding energy is only /3. Finally, 
Hj" has four electrons so one finds two in the each molec- 
ular orbital. Therefore the bonding energy is zero: the 
molecule is predicted to be unstable. 

Thus the Hiickei model makes several predictions: neu- 
tral H2 is predicted to be significantly more stable than 
any of the ionic states; the two singly ionic species are 
predicted to be equally stable; the doubly cationic species 
is predicted to be unstable. Further, the lowest optical 
absorption is expected to correspond to the transition be- 
tween the bonding orbital and the antibonding orbital. 
The energy gap for this transition is 2|/3|. Therefore, the 
lowest optical absorption is predicted to be the same in 
the neutral species and the singly cationic species. Fur- 
ther, this absorption is predicted to occur at a frequency 
with the same energy as the heat of formation for the 
neutral species. While these predictions do capture qual- 
itatively what is observed experimentally, they are cer- 
tainly not within chemical accuracy (i.e. within ksT ^ 1 
kcal mol-i ~ 0.03 eV for T = 300 K). For example the 
experimentally determined binding energies (jlOp are 2.27 
eV for H+, 4.74 eV for H2, 1.7 eV for while Hj- is 
indeed unstable. 
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FIG. 2 The molecular orbitals for benzene from vr-Huckel the- 
ory. Different colours indicate a change in sign of the wave- 
function. In the neutral molecule the A.2u and both Eig states 
are occupied, while the B2g and E2u states are virtual. Note 
that we have taken real superpositions (|10[) of the two-fold 
degenerate states in order to facilitate these plots. 

2. TT-Hijckel theory of benzene 

For many organic molecules a model known as tt- 
Hiickel theory is very useful. In Tr-Hiickel theory one 
considers only the tt electrons. A simple example is a 
benzene molecule. The hydrogen atoms have no tt elec- 
trons and are therefore not represented in the model. 
This leaves only the carbon atoms, so again we can set 
ai = a and Pij = j3. Because of the ring geometry of 
benzene (and assuming that the molecule is planar) the 
Hamiltonian becomes 

■HHiickei = a'^h^^ + {c\^CiJ^i„ + 4+1,7 Cm) , (26) 

where the addition in the site index is defined modulo six, 
i.e., site number seven is site number one. For benzene 
we have six solutions per spin state, which are 

iV'A.j = ^ (cL + cL + cL + 4 + C + cL) |o), 

= ^ (cL + ecL + e'cL - cL - eel, " |0), 

= ^ (cL - - 4a - + e4 + 4) lo), 

= ^ (cL + e'cL - 4a + cL + e'4a - 4a) |0), 
= \ (cla - e4a + e'4a + 4a ^ e4a + e'4a) |0) 



and 

IV'B.J = ^ (4a - 4a + 4a " 4a + 4a " 4a) |0), 

where e = e*'^/'^. These wavefunctions are sketched in 
Fig. [2] The energies of these states are — ol — 2\I3\^ 

Ee,, - i?^,, - « - |/3|, = ="a + and 

EB2g = a + 2\P\. The subscripts are symmetry labels 
(HU [131) for the group De/i and one should recall that, 
because we are dealing with 7r-orbitals, all of the orbitals 
sketched here are antisymmetric under reflection through 
the plane of the page. The degenerate (Eig and E2u) 
orbitals are typically written/drawn rather differently (cf. 
Ref. nop . However, any linear combination of degenerate 
eigenstates is also an eigenstate; this representation was 
chosen as it highlights the symmetry of the problem. For 
a more detailed discussion of this problem see Ref. 1141 

3. Electronic interactions and the parameterisation of the 
Hijckel model 

As noted above the Hiickel model does not explicitly 
include interactions between electrons. This leads to se- 
rious qualitative and quantitative failures of the model, 
some of which we have seen above and which will discuss 
further below. However, given the (mathematical and 
conceptual) simplicity and the computational economy 
of the method one would like to improve the method 
as far as possible. So far we have treated the theory 
as parameter free. However, if we treat the model as a 
semi-empirical method instead one can include some of 
the effects due to electron-electron interactions without 
greatly increasing the computational cost of the method. 
For example, one can make a dependent on the charge 
on the atom. This is reasonable, as the more electrons 
we put on an atom the harder it is to add another due 
to the additional Coulomb repulsion from the extra elec- 
trons. The simplest way to account for this is the 'w 
technique' (jTUJ) where one replaces 

tti a- = at + uj{qo - gi)/3, (28) 

where qt is the charge on atom i, qq is a (fixed) reference 
charge and is a parameter. The uj technique suppresses 
the unphysical fluctuations of the electron density, which 
are often predicted by the Hiickel model (cf. the dis- 
cussion of H2 above). Similar techniques can also be 
applied to /3. These parameterisations only slightly com- 
plicate the model and do not lead to a major inflation 
of the computational cost, but can significantly improve 
the accuracy of the predictions of the Hiickel model (|T?|) . 

B. Crystals (the tight binding model) 

For infinite systems it is necessary to work with a fixed 
chemical potential rather than a fixed particle number. 
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Therefore before we discuss the tight binding model we 
will briefly review the chemical potential (also see Ref. [6] 
for a discussion of the chemical potential in a chemical 
context). 



1. The chemical potential 

When one is dealing with a large system keeping track 
of the number of particles can become difficult. This 
is particularly true in the thermodynamic limit where 
the number of electrons = (N) — > oo and the vol- 
ume of the system — )■ oo in such a way so as to en- 
sure that the electronic density, Ue — N^/V, remains 
constant. Lagrange multipliers (16) are a powerful and 
general method for imposing constraints on differential 
equations (such as the Schrodinger equation) without 
requiring the solution of integro-differential equations. 
Briefly, consider a function, f{x,y,z,...) that we wish 
to extremise (minimise or maximise) subject to a con- 
straint that means that x, y, z . . . are no longer inde- 
pendent. In general we may write the constraint in the 
form 4>{x,y, z, . . .) = 0. This allows us to define the 
function g(a;, y, z . . . , A) = /(a;, y,z...) + X(j){x, y,z...), 
where A is known as a Lagrange multiplier. One may 
show ()16p that the extremum of g{x, y, z . . . , A) with re- 
spect to X, y, z . . . and A is the extremum of f{x, y, z . . .) 
with respect to x, y, z . . . subject to the constraint that 
(j){x,y,z,...) =0. 

Typically the problem we wish to solve in chemistry 
and condensed matter physics is to minimise the free en- 
ergy, F, (which reduces to the energy, E, a.t T = 0) 
subject to the constraint of having a fixed number of 
electrons (determined by the chemistry of the material 
in question). This suggests that one should simply in- 
troduce a Lagrange multiplier to resolve the difficulty of 
constraining the number of electrons in the thermody- 
namic limit. A suitable constraint could be introduced 
by adding the term A(A"o — A^) to the Hamiltonian, where 
A^o is the chemically required number of electrons, and 
requiring the the free energy is an extremum with respect 
to A. However, one can also impose the same constraint 
and achieve additional physical insight by subtracting the 
term fj,N from the Hamiltonian and requiring that 



A^o 



dF 



(29) 



The chemical potential (for electrons), fi, is then given 
by 



dF 



(30) 



Therefore, specifying a system's chemical potential is 
equivalent to specifying the number of electrons, but pro- 
vides a far more powerful approach for bulk systems. 

Physically this approach is equivalent to thinking of 
the system as being attached to to an infinite bath of 



electrons, i.e., one is working in the grand canonical en- 
semble (fT7|) . Thus, the Fermi distribution for the system 
is given by 



fiE,T) 



1 



(31) 



Therefore at T = all of the states with energies lower 
than the chemical potential are occupied, and all of 
the states with energies greater than the chemical po- 
tential are unoccupied. Therefore, the Fermi energy, 
Ep — /x(T = 0). Note that as F is temperature depen- 
dent Eq. [30] shows that, in general, /i will also be tem- 
perature dependent.^ Nevertheless Eq. 31 gives a clear 



interpretation of the chemical potential at any tempera- 
ture: n{T) is the energy of a state with a 50% probability 
of occupation at temperature T. 



2. The tight binding model 

For periodic systems (crystals) one usually refers to the 
Hiickel model as the tight binding model. Often one con- 
siders models with only 'nearest neighbour' terms, that 
is one takes ta — — e^, tij = t ii i and j are at nearest 
neighbour sites, and tij = otherwise. Thus, for nearest 
neighbour hopping only, 

Hth - nN ^ ~tY^ cl^Cja + ^(e» - m)cLc*<t, (32) 

(ij)cr iff 

where fj, is the chemical potential and (ij) indicates that 
the sum is over nearest neighbours only. Further, if we 
consider materials with only a single atomic species we 
can set = yielding 



n 



tb 



(33) 



3. The one dimensional chain 

The simplest infinite system is a chain with nearest 
neighbour hopping only. As we are on a chain the sites 
have a natural ordering and the Hamiltonian may be 
written as 



(34) 

We can solve this model exactly by performing a lattice 
Fourier transform. We begin by introducing the recipro- 



^ In contrast, as Ep is only defined at T = it is not temperature 
dependent. 
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FIG. 3 (a) The dispersion relation, Sk = — 2t cos(fca), of the one dimensional tight binding chain with nearest neighbour hopping 
only, (b) Shaded area shows the filled states for = —t. (c) Shaded area shows the filled states for = 0. (d) Shaded area 
shows the filled states for /j, = t. 



cal space creation and annihilation operators: 



ikRi 



N 

rrf / y fecr 



(35a) 
(35b) 



where k is the lattice wavenumber or crystal momentum 
and Ri is the position of the i**^ lattice site. Therefore, 



ikk' a 



J(k'-k)Ri 



— ika 



where a is the lattice constant, i.e., the distance between 
neighbouring sites Ri and Ri+i- J2i s*^*^ — 5{k' — 

k) (Hll); therefore 

iith - fiN = E [ ^ COs(fca) cl^Cka - fJ-cl^Ckcr 
k(T 



where Sk = —2tcos{ka) is known as the dispersion rela- 
tion. Notice that Eq. ISTlis diagonal, i.e., it only depends 
on the number operator terms, Uka — c^^Cka- Therefore 



the energy is just the sum of Sk for the states ka that 
are occupied, and we have solved the problem. We plot 
the dispersion relation in Fig. [3^. For a tight binding 
model calculating the dispersion relation is equivalent to 
solving the problem. 

The chemical potential, /i, must be chosen to ensure 
that there are the physically required number of elec- 
trons. Changing the chemical potential has the effect of 
moving the Fermi energy up or down the band and hence 
changing the number of electrons in the system. For ex- 
ample (see Fig. [Sja-d), in the above problem the half 
filled band corresponds to /i = 0; the quarter filled band 
corresponds to ^ = —t; and the three quarters filled band 
corresponds to fi = t. 



4. The square, cubic and hypercubic lattices 

In more than one dimension the notation becomes 
slightly more complicated, but the mathematics does not, 
necessarily, become any more difficult. The simplest gen- 
eralisation of the chain we have solved above is the two 
dimensional square lattice where 



n 



tb 



(38) 
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Recall that (ij) indicates that the sum is over nearest 
neighbours only. To solve this problem we simply gener- 
alise our reciprocal lattice operators to 



1 



k 



(39a) 
(39b) 



where k — {k^,ky) is the lattice wavevector or crystal 
momentum and = {xi,yi) is the position of the i**^ 
lattice site. We then simply repeat the process we used 
to solve the one dimensional chain. As the lattice only 
contains bonds in perpendicular directions the calcula- 
tions for the X and y directions go through independently 
and one finds that 



n 



tb 



/iiV-^(£k 



Ai)cLck<T, 



(40) 



k(T 



where the dispersion relation is now £k = 
—2t[cos{kxax) + cos^kytty)] and is the lattice 

constants in the v direction. 

A three dimensional cubic lattice is not any more dif- 
ficult. In this case k = {kx,ky,kz) and the solution is 
of the form of Eq. 40 but with = —'2.t\cos(kxax) + 
cos{kyay) + cos(fczaz)]7^ Indeed so long as we keep all the 
bonds mutually perpendicular one can keep generalising 
this solution to higher dimensions. This may sound some- 
what academic as no materials live in more than three 
dimensions, but the infinite dimensional hypercubic lat- 
tice has become important in recent years because many 



models that include interactions can be solved exactly in 
infinite dimensions as we will discuss in section IIV.D.2I 



5. The hexagonal and honeycomb lattices 

Even if the bonds are not all mutually perpendicu- 
lar the solution to the tight-binding model can still be 
found by Fourier transforming the Hamiltonian. Three 
important examples of such lattices are the hexagonal 
lattice (which is often referred to as the triangular lat- 
tice, although this is formally incorrect), the anisotropic 
triangular lattice, and the honeycomb lattice, which are 
sketched in Fig. |4] For each lattice the solution is of the 
form of Eq. Hol For the hexagonal lattice 



Ek = —2tcos{kxax) — 4<cos I —^kyUy 1 cos 



For the anisotropic triangular lattice 



kxCL:r 



(41) 



£k = cos(fc^aa;) H- cos(fcj^aj^) — 2t' cos{kj.ax + kyUy) . 

. .^^^^ 

The honeycomb lattice has an important additional 
subtlety, that there are two inequivalent types of lattice 
site (cf. Fig. |4j;) , which it is worthwhile to work through. 
We begin by introducing new operators, Ciya, which an- 
nihilate an electron with spin a on the v^^ sublattice in 
the i"^ unit cell, where v — A or B. Therefore we can 
rewrite Eq. [38] as 



-'El- 



1 

1 



CiBa 



ko- 



CkAcr 
CkBcr 



\ / CkAfT 

K ^ J \ CkBo- 



where /ik = e'''-" + e'^i^'^+^^y'^"-/'^ + e-*('=--\/3'^y)°/2. Therefore 



Ek = ±t\hiA=±t 



\ 



3 + 2cos(V3fc,ya) + 4cos (^^^b^ 



(43) 



3 A/'p Qj 



(44) 



We plot this dispersion relation in Fig. [Sj 

The most interesting features of this band structure 
are called the 'Dirac points'. The Dirac points are lo- 
cated at k = nK -I- mK' where n and m are integers, 
K = (27r/3a,27r/3\/3a), and K' = (27r/3a, -27r/3\/3a). 



In order to see why these points are interesting consider 
a point K 4- q in the neighbourhood of K. Recalling that 
cos{K + (?) = cos K ~ q sin K + ^q^ cos K + . . . one finds 
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FIG. 4 The (a) hexagonal (triangular), (b) anisotropic triangular, (c) honeycomb and (d) kagome lattices. The hexagonal 
lattice contains two inequivalent types of lattice site, some of which are labelled A and B. The sets of equivalent sites are 
referred to as sublattices. 




FIG. 5 The Dirac dispersion of the honeycomb lattice 

that, for small |q|, 

£K+q = ^wfIqI + ■ • ■ (45) 

where vp = 3ta/2h is known as the Fermi velocity. 

This result should be compared with the relativistic 
result 

El = m^c^ + h^c^\k\\ (46) 

where m is a particles rest mass and c is the speed of 
light. This reduces to the famous E = m(? for A; = 0, 
but for massless particles, such as photons, one finds that 



-E'k = fi-c|k|. Thus the low-energy electronic excitations 
on a honeycomb lattice behave as if they are massless 
relativistic particles, with the Fermi velocity playing the 
role of the speed of light in the theory. Therefore much 
excitement has been caused by the recent synthe- 
sis of atomically thick sheets of graphene PO)) . in which 
carbon atoms form a honeycomb lattice. In graphene 
vp — ^ 'X 10^ ms^^, two orders smaller than the speed 
of light in the vacuum. This has opened the possibility 
of exploring and controlling 'relativistic' effects in a solid 
state system (fTn|) . 



IV. THE HUBBARD MODEL 

So far we have neglected electron-electron interactions. 
In real materials the electrons repel each other due to the 
Coulomb interaction between them. The most obvious 
extension to the tight binding model that describes some 
of the electron-electron interactions is to allow only on- 
site interactions, i.e., if Vijki ^ if and only if i, j, k and 
I all refer to the same orbital. For one orbital per site we 
then have the Hubbard model, 

{ij)a i 

where we have assumed nearest neighbour hopping only. 
It follows from Eq. [21] that U > 0, i.e., electrons repel 
one another. 
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A. The two site Hubbard model: molecular hydrogen 

The two site Hubbard model is a nice context in which 
to consider some of the basic properties of the chemical 
bond. The two body term in the Hubbard model greatly 
complicates the problem relative to tight binding model. 
Therefore the Hubbard model also presents a nice context 
in which to introduce one of the most important tools in 
theoretical physics and chemistry: mean-field theory. 



n + 6n, where n (m) is the mean value of n (m) and 
dn (6m) are the fluctuations about the mean, which are 
assumed to be small, one notes that 

mn — {fn + 5'm){n + 6n) 

= fnn + fnSn + Smn + SmSn 

w fnn + fiTdn + Smn. (48) 



1. Mean-field theory, the Hartree-Fock approximation & 
molecular orbital theory 

To construct a mean-field theory of any two, as yet 
unspecified, physical quantities, m — m + dm and n = 



Thus mean-field approximations neglect terms that are 
quadratic in the fluctuations. 

Hartree theory is a mean-field in the electron density, 
i.e.. 



(49) 



However, it was quickly realised that this does not allow for electron exchange, i.e., one should also include averages 
such as (cJjC^), therefore a better mean-field theory is Hartree-Fock theory, which includes these terms. However, 
because of the limited interactions included in the Hubbard model the Hartree theory is identical to the Hartree-Fock 
theory if one assumes that spin-fiip terms are negligible, i.e., that (cl^Cii) = 0, which we will. 
The Hartree-Fock approximation to the Hubbard Hamiltonian is therefore 



^HF = —t ^ cLc 



(50) 



where rii^ = {c\^Cia) . Thus we have a Hamiltonian for a single electron moving in the mean-field of the other electrons. 
Note that this Hamiltonian is equivalent to the a; method parameterisation of the Hiickel model (cf. section HI. A. 3 



and particularly Eq. 28 1 if we set uj — U / (i. Thus the u method is just a parameterisation of the Hubbard model 
solved in the Hartree-Fock approximation. 

The Hubbard model with two sites and two electrons can be taken as a model for molecular hydrogen. In the 
Hartree-Fock ground state, I^'hf); ^'^^ electrons have opposite spin and each occupy the bonding state, which we 
found to be the ground state of the Hiickel model in section |III.A.1[ 



I*°hf) 



1 



-2t 



2 (^t'^U 



4i4t + 4t4i 



%)|0) 

)|o). 



(51a) 
(51b) 



Notice that I^'hf) is just a product of two single particle 
wavefunctions (one for the spin up electron and another 
for the spin down electron; cf. Eq. 51a). Thus we say 



that the wavefunction is uncorrelated and that the two 
electrons are unentangled. 

An important prediction of the Hartree- Fock theory is 
that if we pull the protons apart we are equally likely 
to get two hydrogen atoms (H-l-H) or two hydrogen ions 



(H+-I-H ). This is not what is observed experimentally. 
In reality the former is far more likely. 



2. The Heitler-London wavefunction &i valence bond theory 

Just a year after Schrodinger wrote down his wave 
equation (PT|) , Heitler and London (^2^ proposed a theory 
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of the chemical bond based on the new quantum mechan- 
ics. Explaining the nature of the chemical bond remains 
one of the greatest achievement of quantum mechanics. 
Heitler and London's theory led to the valence bond the- 
ory of the chemical bond PSI). The two site Hubbard 
model of H2 is the simplest context in which to study 
this theory. 

The Heitler-London wavefunction is 



1 

71 



(52) 



Notice that the wavefunction is correlated as it cannot 
be written as a product of a wavefunction for each of the 
particles. Equivalently one can say that the two electrons 
are entangled. The Heitler London wavefunction overcor- 
rects the physical errors in the Hartree-Fock molecular 
orbital wavefunction as it predicts zero probability of H2 
dissociating to an ionic state, but is, nevertheless, a sig- 
nificant improvement on molecular orbital theory. 



3. Exact solution of the two site Hubbard model 

The Hilbert space of the two site, two electron Hubbard 
model is sufficiently small that we can solve it analyti- 
cally; nevertheless this problem can be greatly simplified 
by using the symmetry properties of the Hamiltonian. 
Firstly, note that the total spin operator commutes with 
the Hamiltonian |47j as none of the terms in the Hamilto- 
nian cause spin flips. Therefore the energy eigenstates 
must also be spin eigenstates. For two electrons this 
means that all of the eigenstates will either be singlets 
{S = 0) or triplets {S = l). 

Let us begin with the triplet states, |^™)- Con- 
sider a state with two spin up electrons, l^*}). Be- 
cause there is only one orbital per site the Pauli ex- 
clusion principal ensures that there will be exactly one 



electron per site, i.e., I^"}) = cJ^Cj^lO). The electrons 
cannot hop between sites as the presence of the other 
electron and the Pauli principle forbid it. Therefore, 
(*}|(-fcLc2.)|*}) = {^\\{-tcth,)\^\) = for fi =t 
or 4,. There is exactly one electron on each site so 
{'^\\UT.^c\^^^Ai^^i\^\) = 0' Thus the total energy of 
this state \s E\ — Q. 

The same chain of reasoning shows that = 
|0) and E^'^ =0. It then follows from spin ro- 



4^ "-2^ 

tation symmetry that l^*") = '^{'^\-\'^2i + ^u^2t)l^) ^^'^ 

As the Hilbert space contains six states, this leaves 
three singlet states. A convenient basis of these is formed 
by the Heitler-London state and the two charge trans- 
fer states: |*hl) = :^{c\Al ' ^u4t)|0)' l*ct+) = 
775(4^1+44)10) and 1*,^.) = ^(5lt4i-44)|0)- 



symmetry,^ which swaps the site labels 1 O 2, whereas 
l^'ct-) is odd under inversion symmetry. As the Hamil- 
tonian is symmetric under inversion the eigenstates will 
have a definite parity so l^'ct-) is an eigenstate, with en- 
ergy Ect- = U. The other two singlet states are not 
distinguished by any symmetry of the Hamiltonian and 
so they do couple, yielding the Hamiltonian matrix 



n 



(^HLl'HHubbardl^HL) 
(*ct+|'HHubbard|1'HL) 

-2t 

-2t U 



HLl^Hubbardl ^ct-l-/ 
^'ct+|'HHubbard|*ct+) 



(53) 



This has eigenvalues, Eqf — ^{U — ^/lP~^\^161^) and 
-^s^ ~ + Vt/^ -|- 16t'^). The corresponding eigen- 
states are 



I*cf) 



cos6I|*hl) +sin6i|«'ct+ 
cos 61 / ^ 

^It 2^ ^1^^2t 



V2 



sin 6 



V2 

sin6i|*HL) +cos6'|«'ct^ 
sin6> /4 4 _ 4 4 

^2 V'^1T"^2; ^ll^2t 



^i^t ~'~ 4^4 



|0) (54a) 



^/2 



|0)(54b) 



where tan 6* = {U - VW^ + m'^)/4t. For [/ > 0, as is 
required physically, the state |4'cf) is the ground state 
for all values of U/t. |5'cf) is often called the Coulson- 
Fischer wavefunction. 

Inspection of Eq. 54a reveals that for U/t ^ 00 the 



Coulson-Fischer state tends to the Heitler-London wave- 
function, while for U/t ^ we regain the molecular or- 
bital picture (Hartree-Fock wavefunction). 



B. Mott insulators & the Mott-Hubbard metal-insulator 
transition 



In 1949 Mott p4|) asked an apparently simple ques- 
tion with a profound and surprising answer. As we 
have seen above, for the two site Hubbard model both 
the molecular orbital (Hartree-Fock) and valence bond 
(Heitler-London) wavefunctions are just approximations 
to the exact (Coulson-Fischer) wavefunction. Mott asked 
whether the equivalent statement is true in an infinite 



Note that I^'hl) and |^ct+) are even under 'inversion' 



^ It may not be immediately obvious that I^'hl) is even under 
inversion symmetry, but this is easily confirmed as II'J'hl) = 

75(44-44)1°) = 4(-44 + 44)10) = i-i-hl}, 

where X is the inversion operator, wrhich swaps the labels 1 and 
2. 



11 



solid, and, surprisingly, found that the answer is no. Fur- 
ther, Mott showed that the Hartree-Fock and Heitler- 
London wavefunctions predict very different properties 
for crystals. 

One of the most important properties of a crystal is 
its conductivity. In a metal the conductivity is high 
and increases as the temperature is lowered. Whereas 
in a semiconductor or an insulator the conductivity is 
low and decreases as the temperature is lowered. These 
behaviours arise because of fundamental differences be- 
tween the electronic structures of metals and semiconduc- 
tors/insulators ((TT|) . In metals there are excited states 
at arbitrarily low energies above the Fermi energy. This 
means that, even at the lowest temperatures, electrons 
can move in response to an applied electric field. In semi- 
conductors and insulators there is an energy gap between 
the highest occupied electronic state and the lowest unoc- 
cupied electronic state at zero temperature. This means 
that a thermal activation energy must be provided if elec- 
trons are to move in response to an applied field. The 
difference between semiconductors and insulators is sim- 
ply the size of the gap; therefore we will not distinguish 
between the two below and will refer to any material with 
a gap as an insulator. 

Consider a Hubbard model at 'half-filling', i.e., with 
the same number of electrons as lattice sites. In order 
for a macroscopic current to flow, an electron must move 
from one lattice site (leaving an empty site with a net 
positive charge) to a distant site (creating a doubly oc- 
cupied site with a net negative charge) . The net charges 
may move through collective motions of the electrons. 
One could keep track of this by describing the movement 
of all of the electrons, but it is easier to introduce an 
equivalent description where we treat the net charges as 
particles moving in a neutral background. Therefore, we 
will refer to the positive charge as a holon and the neg- 
ative charge as a doublon. In the ground state of the 
valence bond theory all of the sites are neutral and there 
are no holons or doublons (cf. Eq. [52| ). However, it is 
reasonable to postulate that there are low lying excited 
states and hence thermal states that contain a few dou- 
blons and holons. These doublons and holons would in- 
teract via the Coulomb potential, V{r) = —e^/Kr, where 
K is the dielectric constant of the crystal. We know from 
the theory of the hydrogen atom (or, better, positron- 
ium, cf. Ref. !F| that this potential gives rise to bound 
states. Therefore one expects that, in the valence bond 
theory, holons and doublons are bound and separating 
holon-doublon pairs costs a significant amount of energy. 
Thus one expects the number of distant holon-doublon 
pairs to decrease as the temperature is lowered. There- 
fore, the valence bond theory predicts that the half-filled 
Hubbard model is an insulator. 

In contrast the molecular orbital theory has large num- 
bers of holons and doublons (cf. Eq. 51b which sug- 



LU 




gests that for an TV-site model there will be N/2 neu- 
tral sites, N/4, empty sites, and N/A doubly occupied 
sites). Mott reasoned that if there are many holon- 



0.2 0.4 0.6 



FIG. 6 Sketch of Mott's proposal for the energy of the Hub- 
bard model as a function of the number of holon-doublon 
pairs. Up, at low (zero) temperature(s) for large and small 
U/t. 



doublon pairs "it no longer follows that work must nec- 
essarily be done to form some more" . This is because the 
holon and doublon now interact via a screened potential, 
Vlr) = —{e^/nr) exp(— gr), where q is the Thomas-Fermi 
wavevector (cf. Ref. ITT]) . For sufficiently large q there 
will be no bound states and the molecular orbital theory 
predicts that the half-filled Hubbard model is metallic. 

Thus, Mott argued that their are two (local) minima of 
the free energy in a crystal (cf. Fig. [6]). One of the min- 
ima corresponds to a state with no holon-doublon pairs 
that is well approximated by a valence bond wavefunction 
and is now known as the Mott insulating state. The sec- 
ond minimum corresponds to a state with many doublon- 
holon pairs that is well approximated by a molecular or- 
bital wavefunction and is metallic. As we saw above, 
valence bond theory works well for U ^ t and molecular 
orbital theory works well for U <^ t. Therefore, in the 
half-filled Hubbard model we expect a Mott insulator for 
large U/t and a metal for small U /t. Further the 'double 
weir structure of the energy predicted by Mott's argu- 
ment (Fig. |6| suggests that there is a first order metal- 
insulator phase transition, known as the Mott transition. 
Mott predicted that this metal-insulator transition can 
be driven by applying pressure to a Mott insulator. This 
has now been observed in a number of systems; perhaps 
the purest examples are the organic charge transfer salts 
(BEDT-TTF)2X pS]). 

It is interesting to note that this infusion of chemical 
ideas into condensed matter physics has remained im- 
portant in studies of the Mott transition. Of particular 
note is Anderson's resonating valence bond theory of su- 
perconductivity in the high temperature superconductors 
PSI l?7|) . which describes superconductivity in a doped 
Mott insulator in terms of a generalisation of the valence 
bond theory discussed above. This theory can also be 
modified to describe superconductivity on the metallic 
side of the Mott transition for a half-filled lattice. This 
theory then provides a good description of the supercon- 
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FIG. 7 The dispersion relations for spin-up and spin-down 
electrons in the Hartree-Fock theory of the Hubbard chain 
(Stoner model of ferromagnetism) with m — 0.8t/U. 



ductivity observed in the (BEDT-TTF)2X salts (jMI- 

Note that theories, such as Hartree-Fock theory or den- 
sity functional theory ([^^ , that do not include the strong 
electronic correlations present in the Hubbard model do 



not predict a Mott insulating state. Thus weakly cor- 
related theories make the qualitatively incorrect predic- 
tion that materials such as NiO, V2O3, La2Cu04 and 
K-(BEDT-TTF)2Cu[N(CN)2]Cl are metals, whereas ex- 
perimentally all are insulators. 

We will discuss a quantitative theory of the Mott tran- 
sition is section HV.C. 21 



C. Mean-field theories for crystals 

1. Hartree-Fock theory of the Hubbard model: Stoner 
ferromagnetism 

In a similar manner to that in which we constructed the 
Hartree-Fock mean-field theory for the two site Hubbard 
model in section HV . A . II we can also construct a Hartree- 
Fock theory of the infinite lattice Hubbard model. Again, 
we simply replace the number operators in the two body 
term by their mean values, Uicr = {cl^Cia), plus the fluctu- 



ations about the mean, [cl^c. 
that are quadratic in the fluctuations, viz.. 



and neglect terms 



i 

~ u 



t 



(55) 



If we make the additional approximation that tii^ — ria- for all i, i.e., that the system is homogeneous and does not 
spontaneously break translational symmetry, we find that the Hartree-Fock Hamiltonian for the Hubbard model is 



(56) 



where N is the number of lattice sites and a is the opposite spin to a. It is convenient to write this Hamiltonian in 
terms of the total electron density, n = rhf + ni and the magnetisation density, m = — n^, which gives. 



n 



HF 



+U ^)4t^^t + + ™)C4C4 - |(n + m)(n - m) 



NU 



{n^ — Tin?) 



(57) 



where is the dispersion relation for U ~ Q and cr = 
±1 =ti. The last term is just a constant and will not 
concern us greatly. The penultimate term is the 'renor- 
malised' chemical potential, i.e., the chemical potential, 
^, of the system with [/ = is decreased by Un/2 due to 
the interactions. The first term is just the renormalised 
dispersion relation, in particular we find that if the mag- 



netisation density is non-zero the dispersion relation for 
spin-up electrons is different from that for spin-down elec- 
trons (cf. Fig. [7]). It is important to note that the 
Hartree-Fock approximation has reduced the problem to 
a single particle (single determinant) theory. Thus we 
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Guess value of m 



Calculate Dct(e) from 
EkCT for current m 



Calculate f(m] 



T 



Set m=(1 -a)m+0(f(m) 



Does f(m)=m to within the 
specified tolerance? 



No 



Yes 



m=f(m) is a self consistent solution 



FIG. 8 How to find the self consistent solution of Eq. |59[ If 
the convergence works well one can take a = 1, but for some 
problems convergence can be reached more reliably with a 
small value of a (often a value as small as ~ 0.05 is used). 



can write 



NU 



{n^-m^), (58) 



ko- 



where e^.^ = ^ \aUm and fi* — jj. — ^Un. 

We can now calculate the magnetisation density: 



f de[D^ie-^i*)-DUe-t^*)] 

J — OO 



de 



Do [e - ^Um+ ^Un - fi 



-Do [e+^Um+^Un- n 



f{m) = Do{0)Um + 0{m^), 



(59) 



where Do{e) = dNo{e)/de\f is the density of states (DOS; 
cf. Ref. [TT]) per spin for [/ = 0, No{e) is the num- 
ber of electrons (per spin species) for which < e for 
U = 0, D„{e) = dN„{e)/deU is the full interacting DOS 
for spin a electrons, and N^(e) is the number of elec- 
trons with spin a for which Eko- < £• The standard way 
to solve mean-field theories, known as the method of self 
consistent solution, is illustrated in Fig. [Sj The major 
difficulty with self consistent solutions is that it is not 
possible to establish whether or not one has found all of 
the self consistent solutions and therefore it is not possi- 
ble to establish whether or not one has found the global 
minimum. Therefore it is prudent to try a wide range 
of initial guesses for m (or whatever variable the initial 
guess is made in). 

Clearly m = is always a solution of Eq. [59j and 
for UDo{0) < 1 this turns out to be the only solution. 
But, for UDo{0) > 1 there are additional solutions with 
m ^ 0. This is easily understood from the sketch in 
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FIG. 9 Graphical solution of the self-consistency equation 
(Eq. 59 1 for the Stoner model of ferromagnetism. 



Fig. [9] Furthermore, the m solutions typically have 
lower energy than the m = solution and therefore for 
UDq{0) > 1 the ground state is ferromagnetic. UDo{0) > 
1 is known as the Stoner condition for ferromagnetism. 
In order for the Stoner condition to be satisfied a system 
must have narrow bands [small t, and hence large D{0)] 
and strong interactions (large U). 

There are three elemental ferromagnets, Fe, Co and Ni, 
each of which is also metallic. As the Hartree-Fock theory 
of the Hubbard model predicts metallic magnetism if the 
Stoner criterion is satisfied and these materials have nar- 
row bands of strongly interacting electrons it is natural to 
ask whether this is a good description of these materials. 
However, if one extends the above treatment to finite 
temperatures (|30|) one finds that the the Hartree-Fock 
theory of the Hubbard model does not provide a good 
theory of the three elemental magnets. The Curie tem- 
peratures, Tc, (i.e., the temperature at which the mate- 
rial becomes ferromagnetic) of Fe, Co and Ni are ~1000 
K (see, e.g., table 33.1 of Ref. ITT]). The Hartree-Fock 
theory predicts that ~ Urrio, where toq is the mag- 
netisation at T = 0. If the parameters in the Hubbard 
model are chosen so that Hartree-Fock theory reproduces 
the observed mp then the predicted critical temperature 
is ^10,000 K. This, order-of-magnitude, disagreement 
with experiment results from the failure of the mean-field 
Hartree-Fock approximation to properly account for the 
fiuctuations in the local magnetisation. This is closely 
related to the (incorrect) prediction of the Hartree-Fock 
approximation that there are no local moments above Tc- 
(Experimentally local moments are observed above Tc.) 
However, for weak ferromagnets, such as ZrZn2 {Tc ^ 30 
K) the Hartree-Fock theory of the Hubbard model pro- 
vides an excellent description of the observed behaviour 

The effects missed by Hartree-Fock theory are referred 
to as electronic correlations. The dramatic failure of 
Hartree-Fock theory in Fe, Co and Ni shows that elec- 
tron correlations are very important in these materials, 
as do other comparisons of theory and experiment 
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However, it is important to note that mean-field theory is 
not limited to Hartree-Fock theory (although the terms 
are often, but incorrectly, used synonymously). Rather 
Hartree-Fock theory is the mean-field theory of the elec- 
tronic density. By constructing mean-field theories of 
other properties it is possible to construct mean-field the- 
ories that capture (some) electronic correlations. We will 
now consider an example of a rather different mean-field 
theory. 

2. The Gutzwiller approximation, slave bosons & the 
Brinkman-Rice metal-insulator transition 

In 1963 Gutzwiller (!33) proposed a variational wave- 
function for the Hubbard model: 




where g = — ln(l — a) is a variational parameter and 
l^^o) is the ground state for uncorrelated electrons. One 
should note that the Gutzwiller wavefunction is closely 
related to the coupled cluster ansatz (T) , which is widely 
used in both physics and chemistry. Gutzwiller used 
this ansatz to study the problem of itinerant ferromag- 
netism. This leads to an improvement over the Hartree- 
Fock theory discussed above. However, in 1970 Brinkman 
and Rice p4|) showed that this wavefunction also de- 
scribes a metal-insulator transition, now referred to as 
a Brinkman-Rice transition. Rather than studying this 
wavefunction in detail we will instead use an equivalent 
technique known as 'slave bosons'. This has the advan- 
tage of making it clear that the Brinkman-Rice transition 
is just a mean-field description of the Mott transition. 

The site in a Hubbard model has four possible 
states: the site can be empty, |ej); contain a single spin 
a (=t or l) electron \ai); or two electrons, \di). The 
Kotliar-Ruckenstein slave boson technique introduces an 
over-complete description of these states: 

\e^) = e||0,), (61a) 
W^) = p14|0.), (61b) 

and 

where e|, pj^., and d| are bosonic creation operators 
which correspond to empty, partially filled, and doubly 
occupied sites. |0i) is a state with no fermions and no 
bosons on site i; note that this is not a physically real- 
isable state. This transformation is not only kosher, but 
also exact, so long as we also introduce the constraints 

^^Y^AaVio <^t'^i = 1, (62a) 



which ensures that there is exactly one boson per site and 
therefore that each site is either empty, partially occupied 
or doubly occupied, and 

cLcirr - v\aVia ^ = 0, (62b) 

which ensures that if a site contains a spin a electron 
then it is either singly occupied (with spin cr) or doubly 
occupied. 

Writing the Hubbard Hamiltonian in terms of the slave 
bosons yields 

■HHubbard = X! 4o-cLcjT%a + X! ^i^*' ^^^^ 

where Zj„ = ejpjv 

We now make a mean-field approximation and replace 
the bosonic operators by the expectation values: (e^) = e, 
(Pi't) — (Pii) — Pi {(^i) ~ d- Note that we have addition- 
ally assumed that the system is homogeneous (the ex- 
pectation values do not depend on i) and paramagnetic 
((Pit) — {Pil))- Therefore the constraints reduce to 

|e|2 + 2|p|2 + |d|2 = 1 (64a) 

and 

\p\' + \d\' = (cU.) = ^, (64b) 

where n is the average number of electrons per site. This 
amounts to only enforcing the constraints on average. 

This theory does not reproduce the correct result for 
U — 0. However, this deficiency can be fixed if Zja- is 
replaced by the 'renormalised' quantity, Zjc^, defined such 
that 

- — IdP / \ 
i^U.^r) = f^JJn [d + V^-n+\d\^) . (65) 

Let us specialise to a 'half filled' band, n = 1. The 
constraints now allow us to eliminate |pp = | — and 
|e|2 = |d|2. Thus we find that 

^Hubbard =^ -tJ2li\d\^-M^)clcja + UN„\d\'' 

= i(|d|2 - 2\d\^)Y^elh^, + UNo\d\', (66) 

where is the dispersion for U — and N is the num- 
ber of lattice sites. Recall that |dp = {d\di), i.e., |dp is 
the probability of site being doubly occupied. We con- 
struct a variational theory by ensuring that the energy is 
minimised with respect to which yields 

FiJ^ 1 

— = -(Ml - 4|dp) ^ e^(nk.) + 2UNo\d\ = 0. (67) 

Eq. |67] allows one to solve the problem self consistently, 
cf. Fig. [sj For small U this equation has more than one 
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minimum and the lowest energy state has |dp > 0, which 
corresponds to a correlated metalhc state (the details of 
this minimum depend on e^). But, above some critical 
U the ground state has |dp = 0, which corresponds to no 
doubly occupied states, i.e. the Mott insulator. Thus the 
dependence of the energy on the number holon-doublon 
pairs (rip = calculated from the mean-field slave bo- 
son theory is exactly as Mott predicted on rather general 
grounds (shown in Fig. rol). 



D. Exact solutions of the Hubbard model 

1. One dimension 

Lieb and Wu ([55]) famously solved the Hubbard chain 
at T = using the Bethe ansatz ([551 [37]). Lieb and 
Wu found that the half-filled Hubbard chain is a Mott 
insulator for any non-zero U . Nevertheless the Bethe 
ansatz solution is not straightforward to understand and 
weighty textbooks have been written on the subject 



[32]). 



2. Infinite dimensions: dynamical mean-field theory 

As one increases the dimension of a lattice the coor- 
dination number (the number of nearest neighbours for 
each lattice site) also increases. In infinite dimensions 
each lattice site has infinitely many nearest neighbours. 
For a classical model mean-field theory becomes exact in 
infinite dimensions, as the environment (the infinite num- 
ber of nearest neighbours) seen by each site is exactly the 
same as the mean-field. However, quantum mechanically 
things are complicated by the internal dynamics of the 
site. In the Hubbard model each site can contain zero, 
one or two electrons, and a dynamic equilibrium between 
the different charge and spin states is maintained. How- 
ever, the environment is still described by a mean-field, 
even though the dynamics are not. Therefore, although 
the Hartree-Fock theory of the Hubbard model does not 
become exact in infinite dimensions, it is possible to con- 
struct a theory that treats the on-site dynamics exactly 
and the spatial correlations at the mean-field level; this 
theory is known as dynamical mean-field theory (DMFT) 



The importance of DMFT is not in the, somewhat aca- 
demic, limit of infinite dimensions. Rather, DMFT has 
become an important approximate theory in the finite 
numbers of dimensions relevant to real materials (j38| . It 
has been found that DMFT captures a great deal of the 
physics of strongly correlated electrons. Typically the 
most important correlations are on-site and therefore are 
correctly described by DMFT. These include the corre- 
lations that are important in metallic magnetism (|39p 
and many other strongly correlated materials (|25l l38)) . 
Cluster extensions to DMFT, such as cellular dynami- 
cal mean-field theory (CDMFT) and the dynamical clus- 
ter approximation (DCA), which capture some of the 



non-local correlations, have led to further insights into 
strongly correlated materials ([40]) . Considerable success 
has also been achieved by combining DMFT with density 
functional theory ([^T|) . 



3. The Nagaoka point 

The Nagaoka point in the phase diagram of the Hub- 
bard model is the [/ — > oo limit when we add one hole to 
a half filled system. Nagaoka rigourously proved (42; 43]) 
that at this point the state which maximises the total 
spin of the system (i.e., the state with {Sz) = {N — l)/2, 
for an N site lattice) is an extremum in energy, i.e., either 
the ground state or the highest lying excited state. On 
most bipartite lattices (cf. Fig. 



11 1) one finds that this 
However, 



'Nagaoka state' is indeed the ground state (|43 
on frustrated lattices (cf. Fig. 11) the Nagaoka state is 



typically only the ground state for one sign of t ([H|) . 

It is quite straightforward to understand why the Na- 
gaoka state is often the ground state. As we are consid- 
ering the U ^ oo limit there will be strictly no double 
occupation of any sites. One therefore need only consider 
the subspace of states with no double occupation. As 
none of these states contain any potential energy (i.e., 
terms proportional to J7), the ground state will be the 
state that minimises the kinetic energy (the term pro- 
portional to t) . Clearly the ground state is the state that 
maximises the magnitude of the kinetic energy with a 
negative sign. In the Nagaoka state all of the electrons 
align, this means that the holon can hop unimpeded by 
the Pauli exclusion principle, thus maximising the mag- 
nitude of the kinetic energy. It is then a simple matter 
to check whether this is the ground state or the highest 
lying excited state as we just compare the energy of the 
Nagaoka state with that of any other state satisfying the 
constraint of no double occupation. 

Nagaoka's rigourous treatment has not been extended 
to doping by more than one hole and it remains an out- 
standing problem to further understand this interesting 
phenomenon, which shares important features with the 
magnetism observed in the elemental magnets p9p and 
many strongly correlated materials ([31|. 



V. THE HEISENBERG MODEL 

Like the Stoner ferromagnetism we discussed above in 
the context of the Hartree-Fock solution for the Hub- 



bard model (section IV.C.l ) and Hund's rules (which we 
will discuss in section VLB), the Heisenberg model is an 



important paradigm for understanding magnetism. The 
Heisenberg model does not provide a realistic descrip- 
tion of the three elemental ferromagnets (Fe, Co and 
Ni) as they are metals, whereas the Heisenberg model 
only describes insulators. However, as we will see in sec- 
tion [V]C] the Heisenberg model is a good description of 
Mott insulators such as La2Cu04 (the parent compound 
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of the high temperature superconductors) and k-(BEDT- 
TTF)2Cu[N(CN)2]Cl (the parent compound for the or- 
ganic superconductors). The Heisenberg model also plays 
an important role in the valence bond theory of the chem- 
ical bond (45). 

In the Heisenberg model one assumes that there is a 
single (unpaired) electron localised at each site, and that 
the charge cannot move. Therefore, the only degrees of 
freedom in the Heisenberg model are the spins of each 
site (the model can also be generalised to spin > 5 ) . The 
Hamiltonian for the Heisenberg model is 



^Heisenberg ^ ^ 



where = {Sf,Sf,Sf) = ^J2ai3 ^a^aP^ip is the spin 
operator on site i, a = {ax,cry,az) is the vector of Pauli 
matrices, and Jij is the 'exchange energy' between sites 
i and j. 



A. Two site model: classical solution 

In the classical Heisenberg model one replaces the spin 
operator. Si, with a classical spin, i.e., a real vector, S^. 
Thus on two sites, with J12 = J, the energy of the model 
is 



(2) 

Hciscnbc 



= JSi-S2 = J|Si||S2|cOS( 



(69) 



where (j) is the angle between the two spins (vectors). 
The classical energy is minimised by = tt for J > 
and = for J < 0. Thus for J > the lowest energy 
solution is for the two spins to point antiparallel (i.e., in 
opposite directions to one another); we will refer to this 
as the antiferromagnetic solution. For J < the lowest 
energy solution is for the two spins to point parallel to 
one another; we will refer to this as the ferromagnetic 
solution. Note that the difference in energy between the 
antiferromagnetic solution and the ferromagnetic solu- 
tion is 2JIS1IIS2I; so for S = |Si| = IS2I ^ 1/2 the 
energy difference is J/2. 



We now note that the Hilbert space of the two site 
Heisenberg model is spanned by four states (the spin on 
each site may be up or down; in general for an N site 
Heisenberg model the Hilbert space is 2^-dimensional). 
Further notice that the total spin of the model (S = 
Si + S2) commutes with the Hamiltonian, therefore the 
eigenstates will also be eigenstates of the total spin. Thus 
the four eigenstates must be a singlet, 



I*s) = ^(iti)i;2)-Ui)it2))^^(in) 



(68) 

and a triplet. 



m) = 



Iti)it2) = in) 
1 

^(in) + ut)) 



(|tl)U2) + Ul)|t2)) 



Ul)U2 



;t)), 

(72) 
(73a) 

(73b) 
(73c) 



It is now straightforward to calculate the total energy 
of the model for these states, 



= J(«',|Si-S2|* 

at I 



3J 



- {s+s- + srs:! 



SfS^ 



(1 n) - 1 ;t>) 

(74) 



and 



Et - J(*+|Si •82!^'+) 

= J(*?|Si.S2|*?) = 

'1 



j(vi/-|Si.s2|vi'r 



= J{U\ 



2 



srsi 



QZ QZ 



U) 



J 



(75) 



B. Two site model: exact quantum mechanical solution 

In order to solve the quantum mechanical version of 
the two site Heisenberg model it is useful to define the 
spin raising and lowering operators 



St ^ S't 
S- - — Sa 



(70a) 
(70b) 



Let us denote the state with spin up on site i as | ti) and 
the state with spin down on site i as | li). Therefore, 

^+1 t.) = 0, ^+1 U) = I t^), S-\ t^) = I and U 
) = 0. Further, it is straightforward to confirm that 



Si • S2 



2 y^^^j 



SZ QZ 
A O A 



(71) 



Thus we find that the singlet-triplet splitting for the 
quantum mechanical two site Heisenberg model is J. 



C. The Heisenberg model as an effective low-energy theory 
of the Hubbard model 

Consider a two site Hubbard model with two electrons 
and U ^ t, which is known as the atomic limit. U ^ t 
implies — )• in Eq. |54a| and that the ground state is 
the Heitler-London state, which is a singlet. The two 
other singlet eigenstates are the charge transfer states, 
which have energy ~ U and so will not participate in 
any low-energy processes, i.e., will not be involved in the 
interesting physics or chemistry. Therefore we can 'inte- 
grate out' the charge transfer states and derive a simpler 
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model with a smaller Hilbert space. A model derived in 
this manner is known as an 'effective low-energy Hamil- 
tonian' (see section VIII ) . In this case we will use second 
order perturbation theory to derive our effective effective 
low-energy Hamiltonian. 

We start by writing the two site Hubbard model as 



'^Hubbard ^ U [ Hq + Jj'Hl 



in) 



(76) 



Thus it 



where Ho = Z^i ^iT"4 '^nd Hi = ^ 
is clear that the small parameter for our perturbation the- 
ory is t/U. For t = the ground state is four-fold degen- 
erate; the four states involved being the Heitler-London 
state and the triplet states. Formally one should there- 
fore use degenerate perturbation theory. But, the per- 
turbation, 7^1, does not connect any of the four ground 
states, therefore, to second order (which is all we will 
consider), non-degenerate perturbation theory will yield 
the same results. As it simplifies the discussion, we will 
frame our discussion in terms of non-degenerate pertur- 
bation theory. It is left as an exercise to the reader to 
show that on adding the appropriate projection opera- 
tors (cf. Ref. |46|) to perform degenerate perturbation 
theory the result is unchanged. 

Consider the related Hamiltonian with "Hq ^ + 
'7|^hl)(^'hl| in the limit 77 — ^ 0~ (i.e., as 77 tends to 
from below) this has the same properties as Eq. 76 ex- 



cept that I^'hl) is the true ground state and we may use 
non-degenerate perturbation theory. (5'hlI'Hi I^'hl) = 
so there is no correction to the ground state energy to 
first order in t/U. The second order change in the ground 
state energy, AE'-^^ is given by 



(^'HLlHilV'a) 



Eg — i^HL 



(77) 



where the sum over s runs over all states except the 
ground state. Note that, as is true in general, the second 
order contribution to the ground state energy is negative. 
Evaluating the matrix elements (cf. Fig. 10 1 one finds 
that 



u 



(78) 



In contrast if we add an infinitesimal term to make one of 
the triplet states the true ground state, e.g. 0~|^'})(^i|, 
we find that the Pauli exclusion principle ensures that 
(^'} l^i IV's) = for all s. Thus there is no change in the 
energy of the triplet state to second order in t/U . 

Therefore it is clear that for U/t 00 the half- filled 
Hubbard model reduces to the Heisenberg model, i.e., 
the eigenstates and energies are the same, if we set 
J = 4|tp/J7. This result is not a special property of 
the two site model and is true to second order for an ar- 
bitrary lattice (j47p as second order perturbation theory 
only couples sites i and j if the hopping integral between 
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FIG. 10 Sketch of the superexchange processes that lead to 
the effective antiferromagnetic Heisenberg coupling between 
nearest neighbours in the large U/t hmit of the half-filled Hub- 
bard model. These processes lower the energy of the singlet 
state by At^ /U as their are four paths, the matrix element 
between the ground state and the intermediate states is ~t, 
and the intermediate states are higher in energy by U . The 
energy of the triplet states is unchanged to by perturbations 
to second order m t/U as the Pauli exclusion principle pre- 
vents two electrons with the same spin from occupying the 
same site. 



them is non-zero. For an arbitrary lattice to second order 

As the Heisenberg model is the large U/t limit of the 
half-filled Hubbard model electronic correlations are vi- 
tally important in the physics of materials described by 
the Heisenberg model. Therefore weakly correlated theo- 
ries, such as Hartree-Fock and density functional theory, 
give qualitatively incorrect results. 



D. Frustration: the solution of the three and four site 
classical Heisenberg models 

Before considering the classical three site model, let us 
spend a moment discussing the classical four site model. 
We assume that the four sites are situated on the ver- 
tices of a square and there is an exchange interaction J 
between nearest neighbours (i.e., along the sides of the 
square), but no interactions between next nearest neigh- 
bours (i.e., along diagonals of the square). The energy of 
the model is 



" 'cos%, (79) 



E 



where 9ij is the angle between the spins on sites i and j. 
Si is the spin on the i^^ lattice site and in the last equality 
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FIG. 11 Examples of classical spins on (a) a bipartite cluster 
and (b) a frustrated cluster. On the square cluster (a) one can 
arrange all of the spins antiferromagnetically, i.e., so that each 
spin is antiparallel to all of its nearest neighbours. The same 
is true for the square lattice. This cannot be accomplished on 
either the triangular cluster (b) or the triangular lattice. In 
each panel there is an exchange interaction, J, between any 
two spins joined by a line. Modified from Ref. (82] 



we have specialised to the case |Si| = 1/2 for alH. Notice 
that the Hamiltonian is just a sum over the 'bonds' (sides 
of the square). As for the two site model (section [V.A I, 
the solution depends on the sign of J. For J < the 
lowest energy state is ferromagnetic (all of the spins align 

parallel to one another) and has energy, E^^isenheYg ~ 
For J > the lowest energy state is antiferromagnetic 
(each spin aligns antiparallel to its nearest neighbour; 
see Fig. [TT^). Thus the four site cluster is split into two 
'sublattices' with all of the spins parallel to one another 
within the same sublattice and antiparallel to spins on 
the other sublattice. The antiferromagnetic arrangement 

of spins therefore has energy -Eniisenberg — Thus we 
find that the energy difference between the ferromagnetic 
and antiferromagnetic arrangements is 2 J. 



A lattice that can be split, as described above, into 
two sublattices such that all nearest neighbours are on 
different sublattices is referred to as bipartite. Both the 
four site (square) and two site lattices are bipartite. For 
bipartite lattices the energy difference between the ferro- 
magnetic and antiferromagnetic arrangements is JNz/A, 
where z is the coordination number of the lattice and N 
is the number of lattice sites. This is because the energy 
of each bond can be optimised regardless of what hap- 
pens to other bonds for either sign of J. It can be seen 



from Fig. 11 that the triangular lattice is not bipartite; 
this leads to significant differences in its physics. 

Before analysing this model mathematically let us con- 
sider some of those differences. Clearly for J < 
it is straightforward to arrange the spins ferromagneti- 
cally. Further, as the energy of each of the three bonds 
will be optimised in this arrangement we expect the to- 
tal energy of this state to be ^^Hcisonbcrg = 3J/4 for 
S — 1/2. But, as is shown in Fig. 11 d, one cannot 



arrange three spins antiferromagnetically on a triangular 
lattice. Thus, for J > 0, we cannot optimise the energy 
of each bond individually. When this is the case one says 
that the lattice is 'frustrated'. For a frustrated lattice 
with S = 1/2 we expect the solution for J > to have 



energy ^^fjcisonborg > —3 J/4 and thus one expects the 
difference in energy between this state and the ferromag- 
netic state to be < JNz/A. The concept of frustration 
can also be generalised to itinerant systems where a sim- 
ilar reduction in the bandwidth of the itinerant electrons 
is found (j44l) . 

Having outlined our expectations, let us now consider 
the three site Heisenberg model more carefully. The en- 
ergy is given by 

p(3) 



Heisenberg ~ 5Z ^' ' 



(80) 



Without loss of generality we can choose Si = 
5i(l,0,0); S2 = 52(cos02,sin02,O); and S3 
53 (cos 63 COS 03 , cos ^3 sin ^3 , sin ^3 ) . Thus, for Si = S2 = 
S3 = 1/2, 



E, 



(3) 

Hciscnbcrg 

J 

4 



(81) 



cos 02 + COS COS( 02 



COS 6^3 COS 03 



Physically we seek the minimum energy, which yields the 
conditions 



dE 



(3) 

Hciscnbcrg 



= — sm £^3 



COS(02 - 03) + COS 03 



dE, 



(3) 

Hciscnbc 



(903 



and 

Hciscnbcrg 



— cos 0-i 
4 ^ 



sm(02 



5) - sint 



cos 6'3 Sm(02 



3) + sin( 



= 0, 
(82a) 

= 
(82b) 

= 0. 

(82c) 



For J < the global minimum is, unsurprisingly, ^3 — 
02 = 03 = 0, i.e., ferromagnetism. The energy of the 
ferromagnetic state is 3J/4. For J > there are sev- 
eral degenerate minima, which all show the same physics. 
For simplicity we will just consider the minimum 63 — 0, 
02 — 27r/3, and 03 = 47r/3. In this solution each of 
the spins points 120° away from each of the other spins, 
hence this is known as the 120° state. It is left as an ex- 
ercise to the reader to identify the other solutions of Eqs. 
|82[ to show that there are none with lower energy than 
those discussed above, and to show that all of the degen- 
erate solutions are physically equivalent. The energy of 
the 120° state is —3 J/8 and hence the energy difference 
between the ferromagnetic state and 120° state is just 
9 J/8, i.e., less than we would expect {JNz/A = 3 J/2 for 
iV = 3, z = 2) for a bipartite lattice. 



E. Three site model: exact quantum mechanical solution 

Group theory, the mathematics of symmetry, allows 
one to solve the quantum spin 1/2 three site Heisen- 
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berg model straightforwardly. Unfortunately space does 
not permit an introduction to the relevant group theory. 
Therefore the reader who is not familiar with the math- 
ematics is advised either to refer to one of the many ex- 
cellent textbooks on the subject (e.g., Refs. [T^ [T31 or, 
failing that, to simply check that the wavefunctions de- 
rived by the group theoretic arguments below are indeed 
eigenstates. 

The Hamiltonian is 



Hciscnbcrg ~ 

"1 



2 \^t^'] +"^1 ^t] '^^t^] 



(83) 



We begin by noting that 2(g)2®2 = 2®2®4,^ i.e., a 
system formed from three spin 1/2 particles will have two 
doublets (with two- fold degenerate spin 1/2 eigenstates) 
and one quadruplet (with four- fold degenerate spin 3/2 
eigenstates) . 

There are only four possible quadruplet states consis- 
tent with the C3 point group symmetry^ of the model. 
Each of these belong to the A irreducible representation 
of C3. They are 



3/2 
3/2 



^3/2 

^3/2 



'3/2 
3/2 



int) 
1 

1 

i;;;> 



(i;n) + int) 
(im) + i;n) 



im)) 
i;;t)) 



where 1 01/37) = |5f , 5*1, 5*1) and a, /3 and 7 or |. Each 
of these states have energy E = 3J/4 and they are the 
(degenerate) ground states for J < 0. 

We are left with the four doublet states. These belong 
to the two dimensional E irreducible representation of C3 
and, as the Hamiltonian is time reversal symmetric, all 
four doublet states are degenerate. Explicitly the states 
are 



V^l/2 
V^l/2 

V^l/2 
^1/2 



1 

71 
1 

1 

71 
1 

71 



i27r/3 



int) 



i27r/3 



im) 



i27r/3 



m) 



i27r/3 



lUt) 



i;n) 
im) 

i;n) + e-*^'^/^ int) + e"^""^^ ittt) 
itu) + e-'^^/^ \m) + e''"/^ lut) 



* In this notation the integers are the degeneracy of the state. 

^ One might, reasonably, take the view that the model has either 
1)3/1 or Csv In fact the arguments in this section go through 
almost identically for either of these symmetries (with appropri- 
ate changes in notation) due to the homomorphisms from these 
groups to C3. We will use C3 notation for simplicity. 



Each of these states have energy E = —5 J/4 and they 
are the (degenerate) ground states for J > 0. Thus, 
the energy difference between the highest spin state and 
the lowest spin state is 2J. From the solution to the 
two site model (section |V.B ) we expected each of the 
three bonds to yield an energy difference of J between the 
lowest and highest spin states. Thus the frustration has 
a similar effect in both the quantum and classical models, 
i.e. frustration lowers the energy difference between the 
highest spin and lowest spin states. 



F. The Heisenberg model on infinite lattices 

The Heisenberg model can be solved exactly in one di- 
mension, and we will discuss this further below, but not 
in any other finite dimension. However, in more than 
one dimension physics of the Heisenberg model is typ- 
ically very different from that in one dimension, so we 
will begin by discussing, qualitatively, the semi-classical 
spin wave approximation for the Heisenberg model, which 
captures many important aspects of magnetism. A quan- 
titative formulation of this theory can be found in many 
textbooks, e.g. Refs. |TT] [301 

In inelastic neutron scattering experiments a neutron 
may have its spin flipped by its interaction with the mag- 
net; this causes a spin 1 excitation in the material. The 
conceptually simplest spin 1 excitation would be to flip 
one (spin- 1 ) spin; in a one dimensional ferromagnetic 
Heisenberg model this state has energy 2| J| greater than 
the ground state. However, a much lower energy excita- 
tion is a 'spin wave', where each spin is rotated a small 
amount from its nearest neighbours (cf. 



Fig. 12 



a one dimensional ferromagnetic Heisenberg mode spin 
waves have excitation energies of hojk = 2| J|[l — cos(fca)], 
where a is the lattice constant (|30|) . Note, in particular, 
that the excitation energy vanishes for long wavelength 
(small k) spin waves. This spin wave spectrum can in- 
deed be observed directly in neutron scattering exper- 
iments from suitable materials (jM]), and the spectrum 
is found to be in good agreement with the predictions of 
the semi-classical theory in many materials. One can also 
quantise the semi-classical theory by making a 'Holstein- 
PrimakofF' transformation ([30|) . This yields a descrip- 
tion of the low-energy physics of the Heisenberg model 
in terms of non-interacting bosons, which are known as 
'magnons' and have the same dispersion relation as the 
classical spin waves. Similar spin wave and magnon de- 
scriptions can be straightforwardly constructed for the 
antiferromagnetic Heisenberg model 

The effective low-energy physics of the one dimensional 
Heisenberg model is, as noted above, rather different 
from the semi-classical approximation. To understand 
this it is helpful to think of the Heisenberg model as a 
special case of the 'XXZ model': 



H 



XXZ 



Jxy 



I QX QX 



Jz 



7 , ^i+lJ 

(86) 
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FIG. 12 Sketches of 
same model. 



(a) the classical ground state of a ferromagnetic Heisenberg chain and (b) a spin wave excitation in the 



which reduces to the Heisenberg model for J^y = J^ = J . 
For Jz < Jxy < the model displays an exotic quantum 
phase known as a Luttinger liquid. (At Jxy = Jz the 
model undergoes a quantum phase transition from the 
Luttinger liquid to an ordered phase (fi^). 

On the energy scales relevant to chemistry one does not 
need to worry about the fact that protons and neutrons 
are made up of smaller particles (quarks). This is be- 
cause the quarks are confined within the proton/neutron 
()50p . Similarly, in a normal magnet it does not matter 
that the material is made up of spin-i particles (elec- 
trons). As described above, on the energy scales relevant 
to magnets the spins are confined into spin one particles 
called magnons. However, magnons can be described in 
terms of two spin-i spinons, which are confined inside 
the magnon. In the Luttinger liquid the spinons are de- 
confined, i.e., the spinons can move independently of one 
another (cf. Fig. 



13 ). As the magnon is a composite par- 



ticle made from two spinons this is often referred to as 
fractionalisation. A key prediction of this theory is that 
the spinons display a continuum of excitations in neutron 
scattering experiments (as opposed to the sharp disper- 
sion predicted for magnons). The two spinon continuum 
has indeed been observed in a number of quasi-one di- 
mensional materials. (|5ip 

An open research question is: does fractionalisation 
occur in higher dimensions? Because of the success of 
spin wave theory (which implies confined spinons) in 
describing magnetically ordered materials one does not 
expect fractionalisation in materials with magnetic or- 
der. Therefore, one would like to investigate quasi- two 
or three dimensional materials whose low-energy physics 
is described by spin Hamiltonians (such as the Heisen- 
berg model), but that do not order magnetically even 
at the lowest temperatures. Such materials are collec- 
tively referred to as spin liquids. There is a long his- 
tory of theoretical contemplation of spin liquids, which 
suggests that frustrated magnets and insulating systems 
near to the Mott transition are strong candidates to dis- 
play spin liquid physics. However, evidence for real ma- 
terials with spin liquid ground states has been scarce 
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FIG. 13 Sketch of spinons in a ID spin chain, (a) Local 
antiferromagnetic correlations, (b) A neutron scattering off 
the chain causes one spin (circled) to flip, (c, d) Spontaneous 
flips of adjacent pairs of spins due to quantum fluctuations 
allow the spinons (circled) to propagate independently. A 
key open question is: can this free propagation occur in 2D, 
or do interactions confine the spinons? Modified from Ref. 



until very recently, ([52]) but there is now evidence for 
spin liquids in the triangular lattice compound k-(BEDT- 
TTF)2Cu(CN)3 (EiKMl), the kagome lattice (cL Fig. ^ 
compound ZnCu3(OH)6Cl2 (J54') and the hyperkagome 
lattice compound Na4lr308 (55). It remains to be seen 
whether any of these materials support fractionalised ex- 
citations. 
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VI. OTHER EFFECTIVE LOW-ENERGY 
HAMILTONIANS FOR CORRELATED ELECTRONS 

A. Complete neglect of dlfFerentlal overlap, the 
Pariser-Parr-Pople model & extended Hubbard models 

We now consider another model for which the quan- 
tum chemistry and condensed matter physics communi- 
ties have different names. These models belong to class 
of models known as complete neglect of differential over- 
lap (CNDO). For a pair of orthogonal states, (j){x) and 
tpix), the integral over all space of the overlap of the 
two wavefunctions vansishes, i.e., 4>{x)^{x)dx = 0. 
If the differential overlap vanishes then the overlap of 
the two wavefunctions vanishes at every point in space, 
i.e., lima^o (t){x)i}{x)dx = for ah xq- The CNDO 

approximation is simply to assume that the differential 
overlap between all basis states is negligible. Thus CNDO 
implies that V^^fc/ = VukkSijSki (cf. section 
the general CNDO Hamiltonian is 



II.Bl. Thus 



n 



CNDO 



E 



(87) 



where Vij = Vujj 



and the number operator fii, 



The Pariser-Parr-Poplc (PPP) model is the CNDO ap- 
proximation in a basis that only includes the tt electrons. 
Often a Hiickel-like notation is used with Vij = "fij , thus 



ijaa' 



The extended Hubbard model, as with the plain Hub- 
bard model, is typically studied in a basis with one orbital 
per site. Further, one often makes the approximation 
that Vii = [/, Vij = V \i i and j are nearest neighbours 
and Vij = otherwise. This yields 

"HcH = - ^ U]c\^Cj„ + U^ni^fiii + V ^ fiiaUj^'. 

{ij)a i (■y)o-o"' 

(89) 

One can, of course, go beyond CNDO. The most gen- 
eral possible model for two identical sites with a single 
orbital per site is 

■HcH2 = - ^ - X {hi-s + n2a)] [c\^C2a + c\^Cia^ 
a 

-^U ^ fi^^fii:^ + Vnih2 + JSi • S2 

i 

+P (4t4i^2tC2; + 4T4;CitCu) , (90) 

where fii = J^a Si = J^afi cla^apCip, dap. Ss the vec- 
tor of Pauli matrices, J is the direct exchange interaction, 
X is the correlated hopping amplitude, and P is the pair 
hopping amplitude. 



B. Larger basis sets and Hund's rules 

Thus far we have focused mainly on models with one 
orbital per site. Often this is not appropriate, for exam- 
ple, if one were interested in chemical bonding or materi- 
als containing transition metals. Many of the models dis- 
cussed in these notes can be straightforwardly extended 
to include more than one orbital per site. However, while 
writing down models with more than one orbital per site 
is not difhcult, these models do contain significant addi- 
tional physics. Some of the most important effects are 
known as Hund's rules (T). These rules have important 
experimental consequences from atomic physics to biol- 
ogy. In order to examine Hund's rules let us consider the 
atomic limit (t = 0) of an extended Hubbard model with 
two electrons in two orbitals per site: 



'HcHls2o = ^ E ^Ait"-Mi 



Vonin2 + JffSi • S2, (91) 



1 or 2 labels the orbitals, n,, 



' = J2a0^iadai3Cf,i3, U is the Coulomb 



where fi 

repulsion between two electrons in the same orbital, Vq is 
the Coulomb repulsion between two electrons in different 
orbitals, and Jh is the 'Hund's rule coupling' between 
electrons in different orbitals. Notice that the Hund's 
rule coupling is an exchange interaction between orbitals. 
Further, if we compare the Hamiltonian with the defini- 
tion given in Eq. [2l]we find that 

-Jh = J rf'ri J d='r2 0*i(ri)02(ri) 

xV{ri - r2)02(r2)*0i(r2) 

^ Jdh, Jdh2 |0i(ri)|V(ri-r2)|</)2(r2)|' 

> 0. (92) 

as V^(ri— r2) is positive semidefinite. Therefore, typically, 
Jh < 0, i.e., the Hund's rule coupling favours the parallel 
alignment of the spins in a half-filled system. 

U is the largest energy scale in the problem, so, for sim- 
plicity, let us consider the case U 00. For Jh — there 
are four degenerate ground states: a singlet, ^(| ti 

) ~ I it)) (where the first arrow refers to the spin of 
the electron in orbital 1 and the second arrow refers to 
the spin in orbital 2), and a triplet, | ft); I ii) £^nd 
7f (I ti) - I it))- But, for J > the energy of the triplet 
states is Jh lower than that of the singlet state. Indeed 
spin symmetry implies that even if we relax the condition 
f/ — > 00 the triplet state remains lower in energy than the 
singlet state as physically we require U > Jh- One can 
repeat this argument for any number of electrons in any 
number of orbitals and one always finds that the highest 
spin state has the lowest energy. However, if one studies 
models with more than one site and moves away from 
the atomic limit {t = 0) one finds that there is a sub- 
tle competition between the kinetic (hopping) term and 
the Hund's rule coupling which means that the high spin 
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FIG. 14 Sketch of the toy model for a transition metal oxide, 
Hamihonian |94[ with two transition metal sites (1 and 2) and 
a single oxygen site (O). 



state is not always the lowest energy state. Many such 
interesting effects can be understood on the basis of a 
two site generalisation of this two orbital model ((56)) . 



C. The ionic Hubbard model 

Thus far we have assumed that all sites are identical. 
Of course, this is not always true in real materials. In 
a compound more than one species of atom may con- 
tribute to the low-energy physics (|57p or different atoms 
of the same species may be found at crystallographical 
distinct sites (|^ [55|l . A simple model that describes this 
situation is the ionic Hubbard model: 



(93) 



where = tu is the site energy, which will be taken 
to be different on different sites. Note that in the ionic 
Hubbard model all sites are assumed to have the same 
U. 

An important application of the ionic Hubbard model 
is in describing transition metal oxides f|S7i) . Typically 
Ci is larger on the transition metal site than on the oxy- 
gen site, therefore the oxygen orbitals are nearly filled. 
This means that there is a low hole density in the oxygen 
orbitals and, hence, that electronic correlations are less 
important for the electrons in the oxygen orbitals than 
for electrons in transition metal orbitals. If the difference 
between on the oxygen sites and e.i on the transition 
metal sites is large enough then the oxygen orbitals are 
completely filled in all low-energy states and therefore 
need not feature in the low-energy description of the ma- 
terial. However, just because the oxygen orbitals do not 
appear explicitly in the effective low-energy description 
of the material, does not mean that the oxygen does not 
have a profound effect on the low-energy physics. 

To see this consider a toy model with two metal sites 
(labelled 1 and 2) and one oxygen site (labelled O), whose 
Hamiltonian is 



E=Er 



E-Eo-A 




E-Ec 



FIG. 15 Sketch of the processes described by Hamiltonian |94| 
that give rise to the effective hopping integral between the 
two transition metal atom sites. 



as sketched in Fig. 14 which is just the ionic Hubbard 
model with U = Q and A = ei — ~ (-2 — > 0. 
With three electrons in the system and t = {] the ground 
state is four-fold degenerate, the ground states have two 
electrons on the O atom and the other electron on one 
of the metal atoms. If we now consider finite, but 
small, t ^ A we can construct a perturbation theory 
in t/A. One finds that there is a splitting between 
the bonding, 4=(c|g. -f C2o.)cq*Cq 1 |0) and antibonding. 



V2 

^(c|^ — C2cr)cQ.j.CQ^|0), states. The processes that lead 
to this splitting are sketched in Fig. [15) Therefore our 
effective low-energy Hamiltonian is a tight binding model 
involving just the metal atoms: 



(95) 



where, to second order in i/A, the effective mctal-to- 
metal hopping integral is given by 



t* 



A' 



(96) 



Note that, even though t is positive, t* < (or, equiva- 
lently, /3* > 0), in contrast to our naive expectation that 
hopping integrals are positive (/3 < 0; cf. section HI I. 



A 



(94) 



VII. THE HOLSTEIN MODEL 

So far we have assumed that the nuclei or ions form 
a passive background through which the electrons move. 
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However, in many situations this is not the case. Atoms 
move and these lattice/molecular vibrations interact with 
the electrons via the electron-phonon/vibronic interac- 
tion. One of the simplest models of such effects is 
the Holstein model, which we discuss below. Electron- 
vibration interactions play important roles across sci- 
ence. In physics electron-phonon interactions can give 
rise to superconductivity (59), spin and charge density 
waves polaron formation (pT|) and piezoelectricity 

(|59p . In chemistry vibronic interactions impact electron- 
transfer processes dHU), Jahn- Teller effects (pS]) . spec- 
troscopy stereochemistry (jMl) , activation of chem- 
ical reactions ((M)) and catalysis In biology the 
vibronic interactions play important roles in photopro- 
tection photosynthesis (jBS) and vision (EE). It is 
therefore clear that one of the central tasks for condensed 
matter theory and theoretical chemistry is to describe 
electron-vibration interactions. 

In general one may write the Hamiltonian of a system 
of electrons and nuclei as 



H — He + 'Hn + 'Hen, 



(97) 



where He contains those terms that only effect the elec- 
trons, Hn contains those terms that only effect the nuclei 
and Hen describes the interactions between the electrons 
and the nuclei. He might be any of the Hamiltonians we 
have discussed above. However, for the Holstein model 
one assumes a tight-binding form for He- In the normal 
mode approximation ()63|) . which we will make, one treats 
molecular/lattice vibrations as harmonic oscillators (cf. 
section [lI.A I. As the ions carry a charge, any displace- 
ment of the ions from their equilibrium positions will 
change the potential felt by the electrons. The Holstein 
model assumes that each vibrational mode is localised 
on a single site. For this to be the case the site must 
have some internal structure, i.e., the site cannot corre- 
spond to a single atom. Therefore the Holstein model is 
more appropriate for a molecular solids than for simple 
crystals. For small displacements, x^^, of the fj}^ mode 
of the i^^ lattice site we can perform a Taylor expansion 
in the dimensionless normal coordinate of the vibration, 
Qifi = x^f^ yjrni^j_ujif^/h where m^^ and are, respec- 
tively, the mass and the frequency of the jj}^ mode on 
the i**^ site, and we find that 



Hen — 



dQ 



(98) 



In the Holstein model one assumes that the derivative 
vanishes for i ^ j. We may quantise the vibrations in 



the usual way (cf. section II. A) which yields 



Hen — 9. 



(99) 



where a^j^'' destroys (creates) a quantised vibration in the 



/i*'^ mode on the i*^ site, gi^ 



and Hr, 



Eiu^vaLflv Thus 



^Holst 



em = XI ^i'^^J'^ + 

(ij)cr if^ 



(100) 



A. Two site Holstein model 

If we assume that there is only one electron and one 
mode per site then the Holstein model simplifies to 



Hho 



Istc 



huj a. 



(101) 



on two symmetric sites, where fii — ni„ = c|, 



It is useful to change the basis in which we consider 
the phonons to that of in phase (symmetric), s — 
(fli -|- a2)/-\/2, and out of phase (antisymmetric), h — 
(fli — 0,2) /V^, vibrations. In this basis one finds that 



^Ho 



Istcin 



^Hs+H 



be 



(102a) 



where 



and 



H, 



hujs'^s+ -^{s'^ + s){hi+n2) (102b) 
V2 



Hb 



Note that ni 



-t 



E (4. 



C2a + cLcicr 



-J^{b^+b)in,~n2). 



(102c) 



n2 — N, the total number of electrons 
in the problem. As TV is a constant of the motion the 
dynamics of the electrons cannot effect the symmetric 
vibrations and vice versa. Hence all of the interesting 
effects are contained in Hbe and we need only study this 
Hamiltonian below. 



1. Diabatic limit, fioj 3> t 

In the diabatic limit the vibrational modes are assumed 
to instantaneously adapt themselves to the particle's po- 
sition. Thus 



hujb'fb+ -^{b'f + b){ni-h2) 
v2 



fiijfe^&±-^(S^+6). (103) 



The plus sign is relevant when the electron is located on 
site 1 and the minus sign is relevant when the electron 
is on site 2. We now introduced the 'displaced oscillator 
transformation'. 



(104) 
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Therefore we find that 



C2cr + 4<jClCT 



2. Adiabatic limit, hui <^t 

We begin by noting that, as there is only one electron 
the spin of the electron only leads to a trivial two-fold 
degeneracy, and therefore can be neglected without loss 
of generality. A useful notational change is to introduce a 
pseudospin notation where we define Uz = c|g.cicr — C2o.C2ct 
It is important to note that the operators b+ and 6_ ^ c\„C2a+c\„ci„. Therefore the one electron, two 



(105) 



satisfy the same commutation relations as the h operator, 
therefore they describe bosonic excitations. We define 
the ground states of the displaced oscillators by 6_ |0_) = 
and 6 I 10+) = 0. Therefore 



and hence 



similarly 



&|0^ 



6_|0+) 



6+|0_) 



1 



_9_ 

/2 hhj 



0. 



V2g 



0+); 



V2g 



|0-), 



(106) 



(107a) 



(107b) 



i.e., |0±) is an eigenstate of 6zp with eigenvalue ^y/2g/huj. 
The eigenstates of bosonic annihilation operators are 
known as coherent states (f57|) . Eqs. 107 therefore show 
that the ground state of one of the b± operators may be 
written as a coherent state of the other operator (jBSj) . 



Therefore 



exp 



(0+I0-) 



exp 



(108) 



(109) 



which is known as the Franck-Condon factor. 

The Franck-Condon factor describes the fact that, in 
the diabatic limit, the bosons cause a 'drag' on the elec- 
tronic hopping. That is, we can describe the solution of 
the diabatic limit in terms of an effective two site tight 
binding model if we replace t by 



t* = t(0+|0_) = texp 



(110) 



Thus the hopping integral is 'renormalised' by the inter- 
actions of the electron with the vibrational modes (cf. 

This renormalisation is also found in the 



VIII) 



section 

solution of an electron moving on a lattice in the diabatic 
limit. In this context the exponential factor is known as 
the polaronic band narrowing (|61|) . The exponential fac- 
tor results from the small overlap of the two displaced 
operators, and may be thought of as an increase in the 
effective mass of the electron. 



2a'- 

site Holstein model Hamiltonian becomes 



hwb^b- 



b)az 



(111) 



which is often referred to as the spin-boson model. 

Let us now replace the bosonic operators by position 
and momentum operators for the harmonic oscillator de- 
fined as 



2mw 



St 



and 

Therefore 



^^'^{b^ 



! — muix -\- q 

2m 2 



-xa. 



(112a) 



(112b) 



(113) 



The adiabatic limit is characterised by a sluggish 
bosonic bath that responds only very slowly to the mo- 
tion of the electron, i.e., /2m -> 0, which it is often 
helpful to think of as the m — > oo limit. Further, in 
the adiabatic limit the Born-Oppenheimer approxima- 
tion "^S) holds, which implies that the total wavefunc- 
tion of the system, | ^P) , is a product of a electronic (pseu- 
dospin) wavefunction, \(l)e)^ and a vibrational (bosonic) 
wavefunction, IV'ti), i-e., I^*) — \4>e) ®\4'v) ■ Therefore, the 
harmonic oscillator will be in a position eigenstate and 
we may replace the position operator, x, by a classical 
position x, yielding 



iisb = ~ta^+ gJ ——xaz + -mujx'^ (114a) 




^(114b) 



where in the second line we have simply switched to the 
matrix representation of the Pauli matrices. This is easily 
solved and one finds that the eigenvalues are 



-mujx ± 
2 



t2 



muj 



g'^x^ 



1 



-mux ± 



mujg^x^ 
2ht 



(115a) 
(115b) 



where Eq. |115b| holds in the weak coupling limit, gx <C t. 
We plot the variation of these eigenvalues with x in this 
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FIG. 16 The energies of the ground and excited states for a 
single electron in the two site Holstein model in the adiabatic 



weak coupling limit {t ^ g ^ huj). Calculated from Eq. 115a 
X is the position of the harmonic oscillator describing out of 
phase vibrations. 



limit in Fig. [THj Notice that for the electronic ground 
state, -E-, the lowest energy states have x 0. This 
is an example of spontaneous symmetry breaking ()69p . 
as ground state of a system has a lower symmetry than 
the Hamiltonian of the system. Thus the system must 
"choose" either the left well or the right well (but not 
both) in order to minimise its energy. 



VIII. EFFECTIVE HAMILTONIAN OR SEMI-EMPIRICAL 
MODEL? 

The models discussed in these notes are generally 
known as semi-empirical models in a chemical context 
and as effective Hamiltonians in the physics community. 
Here the difference is not just nomenclature, but is also 
indicative of an important difference in the epistemolog- 
ical status awarded to these models by the two commu- 
nities. In this section I will describe two different atti- 
tudes towards semi-empirical models/effective Hamilto- 
nians and discuss the epistemological views embodied in 
the work of two of the greatest physicists of the twentieth 
century. 



A. The Diracian worldview 

Paul Dirac famously wrote (TU) that "the fundamental 
laws necessary for the mathematical treatment of a large 
part of physics and the whole of chemistry are thus com- 
pletely known, and the difficulty lies only in the fact that 
application of these laws leads to equations that are too 
complex to be solved." There is clearly a great deal of 
truth in the statement. In solid state physics and chem- 
istry we know that the Schrodinger equation provides an 
extraordinarily accurate description of the observed phe- 
nomena. Gravity, the weak and strong nuclear forces and 



relativistic corrections are typically unimportant, thus all 
of the interactions boil down to non-relativistic electro- 
magnetic effects. 

Dirac's worldview is realised in the ab initio approach 
to electronic structure. Wherein one starts from the 
Hartree-Fock solution to the full Schrodinger equation 
in some small basis set. One then adds in correlations 
via increasingly complex approximation schemes and in- 
creases the size of the basis set, in the hope that with a 
sufficiently large computer one will find an answer that 
is "sufficiently close" to the exact solution (full CI in an 
infinite complete basis set). 

In the last few decades rapid progress has been made 
in ab initio methods due to an exponential improvement 
in computing technology, methodological progress, and 
the widespread availability of implementations of these 
methods ([7l|l. However, this progress is unsustainable: 
the complexity recognised by Dirac eventually limits the 
accuracy possible from ab initio calculations. Indeed, 
solving the Hamiltonian given in Eq. 



17 is known to 



be computationally hard. Feynman proposed building a 
computer that uses the full power of quantum mechanics 
to carry out quantum simulations (|72p. Indeed, the sim- 
plest of all quantum chemical problems, the H2 molecule 
in a minimal basis set, has been solved on a prototype 
quantum computer (|73p . But, while even a rather small 
scale quantum computer [containing just a few hundred 
qubits ((7^ ] would provide a speed-up over classical com- 
putation, it is believed that the solution of Hamiltonian 
17 remains hard even on a quantum computer (i.e., it is 



believed that even a quantum computer could not solve 



Hamiltonian 17 in a time that grows only polynomially 
with the size of the system (f74|) '). Further, simple exten- 
sions of these arguments provide strong reasons to believe 
that there is no efficiently computable approximation to 
the exact functional in density functional theory (1741) . 
Therefore it appears that the equations will always re- 
main "too complex to be solved" directly. This suggests 
that semi-empirical models will always be required for 
large systems. 



B. The Wllsonian project 

Typically one is only interested in a few low-energy 
states of a system, perhaps the ground state and the first 
few excited states. Therefore, so long as our model gives 
the correct energies for these low-energy states we should 
regard it as successful. This, apparently simple, realisa- 
tion, particularly as embodied by Wilson's renormalisa- 
tion group (|75|) . has had profound implications through- 
out modern physics from high energy particle physics to 
condensed matter physics. 

The basic idea of renormalisation in remarkably sim- 
ple. Imagine starting with some system that has a large 
number of degrees of freedom. As we have noted, for 
practical purposes we only care about about the lowest 
energy states. Therefore one might be tempted to sim- 
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plify the description of the system by discarding the high- 
est energy states. However, simply discarding such states 
will cause a shift in the low-energy spectrum. Therefore, 
one must remove the high energy states that complicate 
the description and render the problem computationally 
intractable in such as way as to preserve the low-energy 
spectrum. This is often referred to as 'integrating out' 
the high energy degrees of freedom (because of the way 
this process is carried out in the path integral formulation 
of quantum mechanics (|76p ). Typically integrating out 
the high energy degrees of freedom causes the parame- 
ters of the Hamiltonian to 'flow' or 'run', i.e., change their 
values. When this happens one says that the parameters 
are renormalised. 

A simple example is the Coulomb interaction between 
the two electrons in a neutral Helium atom. For sim- 
plicity lets imagine trying to calculate just the ground 
state energy. We begin by analysing the problem in the 
absence of a Coulomb interaction between the two elec- 
trons. In the ground state both electrons occupy the Is 
orbital. We would like to work in as small a basis set as 
possible. The simplest approach is just to work in the 
minimal basis set, which, in this case, is just the two Is 
spin-orbitals, 0iso-(r). The total energy of a He atom ne- 
glecting the inter-electron Coulomb interaction is -108.8 
eV (relative to the completely ionised state). Now we re- 
store the Coulomb repulsion between electrons. A simple 
question is; how much does this change the total energy 
of the He atom? In the minimal basis set the solution 
seems straightforward: 



(Is^l^lls^) = 



^^\4>lst\^\c^lsi\ 

47reo|ri - 



34.0 eV. 



(116) 



Therefore it is tempting to conclude that we can model 
the He atom by a one site Hubbard model with U = 
(ls^|y|ls^). However, this yields a total energy for the 
He atom of -74.8 eV, which is not particularly close to 
the experimental value of -78.975 eV (8). 

Let us then continue to consider the problem in the 
basis set of the hydrogenic atom, which is complete due 
to the spherical symmetry of the Hamiltonian. One can 
now straightforwardly carry out a perturbation theory 
around the non-interacting electron solution where we 
take 



i=l 



2m 



and 



Hi = 



47reo|ri - 



(117a) 



(117b) 



A detailed description of this perturbation theory is given 
in chapter 18 of Ref. HI However, for our discussion, 
the key point is that, in this perturbation theory, the 
term (ls^|y|ls^) is simply the first order correction to 



the ground state energy. It is therefore clear why the 
minimal basis set gives such a poor result, it ignores all 
the higher order corrections to the total energy. 

The failure of the simple minimal basis set calcula- 
tion does not, however, mean that the effective Hamil- 
tonian approach also fails, despite the fact that the ef- 
fective Hamiltonian is also in an extremely small basis 
set. Rather, one must realise that, as well as the first 
order contributions, U also contains contributions from 
higher orders in perturbation theory. It is therefore pos- 
sible, although extremely computationally demanding, to 
calculate the parameters for effective Hamiltonians from 
this kind of perturbation theory (|77|) . 

A more promising approach, which has been applied to 
a number of molecular crystals ([78l [79)) . is to use atom- 
istic calculations to parameterise an effective Hamilto- 
nian. For example, density functional theory gives quite 
reasonable values for the total energy of the ground state 
of many molecules. Therefore one approach to calcu- 
lating the Hubbard U is to calculate the ionisation en- 
ergy, / = Eo{N ~ 1) — Eo{N), and the electron affin- 
ity, A = Eo{N) - Eo{N + 1), of the molecule, where 
Eq^h) is the ground state energy of the molecule when 
it contains n electrons and N is the filling correspond- 
ing to half-filled band. One finds that U = I — A — 
Eq{N + 1) + Ea{N - 1) - 2Eo{N). A simple way to see 
this is that if we assume the molecule is neutral when 
it contains N electrons then U corresponds to the en- 
ergy difference in the charge disproportionation reaction 
2M ^ Af + -|- M~ for two well separated molecules, M. 
A more extensive discussion of this approach is given in 
Ref. ESI 

It is worth noting that we have actually carried out this 
program of parametrising effective Hamiltonians three 
times in the discussion above. In section IV. CI we showed 
that the Heisenberg model is an effective low-energy 
model for the half-filled Hubbard model in the limit 
t/U — > 0. In section VI. C we derived an effective tight 



binding model that involved only the metal sites from an 
ionic Hubbard model of a transition metal oxide. Finally, 
in section IVII.A.ll we showed that vibronic interactions 
lead to an effective tight binding model describing the 
low-energy physics of the Holstein model in the diabatic 
limit, and that in this model the quasiparticles (electron- 
like excitations) are polarons, a bound state of electrons 
and vibrational excitations with a mass enhanced over 
that of the bare electron. 

However, to date, the most important method for 
parametrising effective Hamiltonians has been to fit the 
parameters to a range of experimental data, whence the 
name 'semi-empirical'. Of course experimental data con- 
tains all corrections to all orders therefore this is indeed 
an extremely sensible thing to do. But, it is important 
to understand that empiricism is not a dirty word. In- 
deed empiricism is what distinguishes science from other 
belief systems. Further, this empirical approach is ex- 
actly the approach that the mathematics tells one to take. 
It is also important to know that no quantum chemical 
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or solid state calculation is truly ab initio - the nuclear 
and electronic masses and the charge on the electron are 
all measured rather than calculated. Indeed the modern 
view of the 'standard model' of particle physics is that 
it too is an effective low-energy model (50). For exam- 
ple, in quantum electrodynamics (QED), the quantum 
field theory of light and matter, the bare charge on the 
electron is, for all practical purposes, infinite. But, the 
charge is renormalised to the value seen experimentally 
in a manner analogous to the renormalisation of the Hub- 
bard U of He discussed above. Therefore, as we do not, 
at the time of writing, know the correct mathematical 
description of processes at higher energies, all of theoret- 
ical science should, perhaps, be viewed as the study of 
semi-empirical effective low-energy Hamiltonians (80). 

Finally, the most important point about effective 
Hamiltonians is that they promote understanding. Ulti- 
mately the point of science is to understand the phenom- 
ena we observe in the world around us. While the ability 
to perform accurate numerically calculations is impor- 
tant, we should not allow this to become our main goal. 
The models discussed above provide important insights 
into the chemical bond, magnetism, polarons, the Mott 
transition, electronic correlations, the failure of mean 
field theories, etc. All of these effects are much more diffi- 
cult to understand simply on the basis of atomistic calcu- 
lations. Further, many important effects seen in crystals, 
such as the Mott insulator phase, are not found methods 
such as density functional theory or Hartree-Fock theory, 
while post Hartree-Fock methods are not practical in in- 
finite systems. Thus effective Hamiltonians have a vital 
role to play in developing the new concepts that are re- 
quired to understand the emergent phenomena found in 
molecules and solids ((5T|) . 
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